Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/333495
Title: | Monaural and binaural cues based auditory scene analyzer |
Researcher: | Venkatesan R |
Guide(s): | Balaji Ganesh A |
Keywords: | Engineering and Technology Engineering Engineering Electrical and Electronic Auditory Scene Analysis Binaural Speech Segregation Monaural Binaural Cues |
University: | Anna University |
Completed Date: | 2020 |
Abstract: | The quality of speech signals are highly influenced by the background noises and also room reverberation present in real world environments The human auditory system shows very sophisticated capabilities to analyze complex acoustic mixtures especially in multi talker reverberant environments The Computational Auditory Scene Analysis newline CASA includes the designing of machine hearing systems that utilizes the principles of human auditory system The work discusses both binaural speech segregation and also sound localization in different azimuth as well as distance for artificial listening devices It also focuses on separating the desired target speech from the binaural sound mixtures as a front end processing in cock tail party environment The binaural cues such as Interaural Level Difference ILD Interaural Time Difference ITD and Interaural Coherence IC are extracted from auditory front end processing A reliable soft Time Frequency T F mask is generated by using joint acoustic features such as monaural and binaural cues The concatenated spectral and spatial cues are successfully incorporated into LSTM DRNNs based binaural speech segregation classification framework Also the work considers joint approach of soft time frequency masking functions and discriminative objective learning which are promoted as a deterministic built in layer in a recurrent architecture that helps to improve the speech intelligibility and evaluation measures The performance analysis of different deep learning architectures with several aspects including Deep Neural Networks DNN DRNN with and without joint masking DRNN with and without discriminative objective functions have been carried out by using evaluation metrics such as Source to Interference Ratio SIR Source to Distortion Ratio SDR and Source to Artifacts Ratio SAR newline newline |
Pagination: | xxvi, 207p. |
URI: | http://hdl.handle.net/10603/333495 |
Appears in Departments: | Faculty of Electrical Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 19.74 kB | Adobe PDF | View/Open |
02_certificates.pdf | 566.7 kB | Adobe PDF | View/Open | |
03_abstracts.pdf | 10.57 kB | Adobe PDF | View/Open | |
04_acknowledgements.pdf | 171 kB | Adobe PDF | View/Open | |
05_contents.pdf | 15.85 kB | Adobe PDF | View/Open | |
06_listoftables.pdf | 8.85 kB | Adobe PDF | View/Open | |
07_listoffigures.pdf | 14.26 kB | Adobe PDF | View/Open | |
08_listofabbreviations.pdf | 11.39 kB | Adobe PDF | View/Open | |
09_chapter1.pdf | 194.52 kB | Adobe PDF | View/Open | |
10_chapter2.pdf | 112.17 kB | Adobe PDF | View/Open | |
11_chapter3.pdf | 169.84 kB | Adobe PDF | View/Open | |
12_chapter4.pdf | 855.22 kB | Adobe PDF | View/Open | |
13_chapter5.pdf | 1.21 MB | Adobe PDF | View/Open | |
14_chapter6.pdf | 505.54 kB | Adobe PDF | View/Open | |
15_conclusion.pdf | 30.21 kB | Adobe PDF | View/Open | |
16_appendices.pdf | 22.15 kB | Adobe PDF | View/Open | |
17_references.pdf | 78.09 kB | Adobe PDF | View/Open | |
18_listofpublications.pdf | 17.86 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 54.75 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: