Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/333495
Title: Monaural and binaural cues based auditory scene analyzer
Researcher: Venkatesan R
Guide(s): Balaji Ganesh A
Keywords: Engineering and Technology
Engineering
Engineering Electrical and Electronic
Auditory Scene Analysis
Binaural Speech Segregation
Monaural
Binaural Cues
University: Anna University
Completed Date: 2020
Abstract: The quality of speech signals are highly influenced by the background noises and also room reverberation present in real world environments The human auditory system shows very sophisticated capabilities to analyze complex acoustic mixtures especially in multi talker reverberant environments The Computational Auditory Scene Analysis newline CASA includes the designing of machine hearing systems that utilizes the principles of human auditory system The work discusses both binaural speech segregation and also sound localization in different azimuth as well as distance for artificial listening devices It also focuses on separating the desired target speech from the binaural sound mixtures as a front end processing in cock tail party environment The binaural cues such as Interaural Level Difference ILD Interaural Time Difference ITD and Interaural Coherence IC are extracted from auditory front end processing A reliable soft Time Frequency T F mask is generated by using joint acoustic features such as monaural and binaural cues The concatenated spectral and spatial cues are successfully incorporated into LSTM DRNNs based binaural speech segregation classification framework Also the work considers joint approach of soft time frequency masking functions and discriminative objective learning which are promoted as a deterministic built in layer in a recurrent architecture that helps to improve the speech intelligibility and evaluation measures The performance analysis of different deep learning architectures with several aspects including Deep Neural Networks DNN DRNN with and without joint masking DRNN with and without discriminative objective functions have been carried out by using evaluation metrics such as Source to Interference Ratio SIR Source to Distortion Ratio SDR and Source to Artifacts Ratio SAR newline newline
Pagination: xxvi, 207p.
URI: http://hdl.handle.net/10603/333495
Appears in Departments:Faculty of Electrical Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File19.74 kBAdobe PDFView/Open
02_certificates.pdf566.7 kBAdobe PDFView/Open
03_abstracts.pdf10.57 kBAdobe PDFView/Open
04_acknowledgements.pdf171 kBAdobe PDFView/Open
05_contents.pdf15.85 kBAdobe PDFView/Open
06_listoftables.pdf8.85 kBAdobe PDFView/Open
07_listoffigures.pdf14.26 kBAdobe PDFView/Open
08_listofabbreviations.pdf11.39 kBAdobe PDFView/Open
09_chapter1.pdf194.52 kBAdobe PDFView/Open
10_chapter2.pdf112.17 kBAdobe PDFView/Open
11_chapter3.pdf169.84 kBAdobe PDFView/Open
12_chapter4.pdf855.22 kBAdobe PDFView/Open
13_chapter5.pdf1.21 MBAdobe PDFView/Open
14_chapter6.pdf505.54 kBAdobe PDFView/Open
15_conclusion.pdf30.21 kBAdobe PDFView/Open
16_appendices.pdf22.15 kBAdobe PDFView/Open
17_references.pdf78.09 kBAdobe PDFView/Open
18_listofpublications.pdf17.86 kBAdobe PDFView/Open
80_recommendation.pdf54.75 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: