Monaural and binaural cues based auditory scene analyzer

Venkatesan R

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/333495

Title:	Monaural and binaural cues based auditory scene analyzer
Researcher:	Venkatesan R
Guide(s):	Balaji Ganesh A
Keywords:	Engineering and Technology Engineering Engineering Electrical and Electronic Auditory Scene Analysis Binaural Speech Segregation Monaural Binaural Cues
University:	Anna University
Completed Date:	2020
Abstract:	The quality of speech signals are highly influenced by the background noises and also room reverberation present in real world environments The human auditory system shows very sophisticated capabilities to analyze complex acoustic mixtures especially in multi talker reverberant environments The Computational Auditory Scene Analysis newline CASA includes the designing of machine hearing systems that utilizes the principles of human auditory system The work discusses both binaural speech segregation and also sound localization in different azimuth as well as distance for artificial listening devices It also focuses on separating the desired target speech from the binaural sound mixtures as a front end processing in cock tail party environment The binaural cues such as Interaural Level Difference ILD Interaural Time Difference ITD and Interaural Coherence IC are extracted from auditory front end processing A reliable soft Time Frequency T F mask is generated by using joint acoustic features such as monaural and binaural cues The concatenated spectral and spatial cues are successfully incorporated into LSTM DRNNs based binaural speech segregation classification framework Also the work considers joint approach of soft time frequency masking functions and discriminative objective learning which are promoted as a deterministic built in layer in a recurrent architecture that helps to improve the speech intelligibility and evaluation measures The performance analysis of different deep learning architectures with several aspects including Deep Neural Networks DNN DRNN with and without joint masking DRNN with and without discriminative objective functions have been carried out by using evaluation metrics such as Source to Interference Ratio SIR Source to Distortion Ratio SDR and Source to Artifacts Ratio SAR newline newline
Pagination:	xxvi, 207p.
URI:	http://hdl.handle.net/10603/333495
Appears in Departments:	Faculty of Electrical Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	19.74 kB	Adobe PDF	View/Open
02_certificates.pdf		566.7 kB	Adobe PDF	View/Open
03_abstracts.pdf		10.57 kB	Adobe PDF	View/Open
04_acknowledgements.pdf		171 kB	Adobe PDF	View/Open
05_contents.pdf		15.85 kB	Adobe PDF	View/Open
06_listoftables.pdf		8.85 kB	Adobe PDF	View/Open
07_listoffigures.pdf		14.26 kB	Adobe PDF	View/Open
08_listofabbreviations.pdf		11.39 kB	Adobe PDF	View/Open
09_chapter1.pdf		194.52 kB	Adobe PDF	View/Open
10_chapter2.pdf		112.17 kB	Adobe PDF	View/Open
11_chapter3.pdf		169.84 kB	Adobe PDF	View/Open
12_chapter4.pdf		855.22 kB	Adobe PDF	View/Open
13_chapter5.pdf		1.21 MB	Adobe PDF	View/Open
14_chapter6.pdf		505.54 kB	Adobe PDF	View/Open
15_conclusion.pdf		30.21 kB	Adobe PDF	View/Open
16_appendices.pdf		22.15 kB	Adobe PDF	View/Open
17_references.pdf		78.09 kB	Adobe PDF	View/Open
18_listofpublications.pdf		17.86 kB	Adobe PDF	View/Open
80_recommendation.pdf		54.75 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET