Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/9823
Full metadata record
DC FieldValueLanguage
dc.coverage.spatialInformation and Communicationen_US
dc.date.accessioned2013-07-11T04:51:11Z-
dc.date.available2013-07-11T04:51:11Z-
dc.date.issued2013-07-11-
dc.identifier.urihttp://hdl.handle.net/10603/9823-
dc.description.abstractThis Thesis presents a detailed study on speech enhancement algorithm to provide robustness to the Automatic Speaker Recognizer (ASR) in real-life noisy conditions. The main objective of this work is to attenuate the noise component of a noisy speech in order to enhance the quality of the speech processing devices and make them more robust under noisy conditions using wavelet based algorithms and to carry out a comprehensive evaluation and comparison of their performances on speaker recognition task. In this Thesis two new single channel wavelet based speech enhancement methods and a noise robust automatic speaker recognition are developed and reported.. Firstly, a technique using Time Adaptive Discrete Wavelet Thresholding (TADWT) based on Bionic Wavelet Transform (BWT) is proposed. In this approach discrete BWT is used for speech enhancement task, the adaptive nature of the BWT is captured by introducing a time varying linear factor at each scale over time and modified soft thresholding function is used for denoising. This method also provides a good auditory representation (sufficient frequency resolution), good perceptual quality of speech and low computational load. In this thesis, basic spectral subtraction (SS), iterative Wiener filtering (IWF), Ephraim Malah filtering (EMF), Bionic wavelet based thresholding (BWT) techniques and Perceptual wavelet packet transform (PWPT) have been used as baseline methods for speech enhancement tests. Performance evaluation of proposed methods is made based on segmental signal to noise ratio (SSNR), signal to noise ratio (SNR), Itakura-Saito (IS) distance measure and minimum mean square error (MMSE) for the objective speech quality evaluation. The average recognition accuracy of the system is improved while incorporating SE methods as preprocessor while comparing with the recognition rate obtained for degraded speech.en_US
dc.format.extentxxviii, 177p.en_US
dc.languageEnglishen_US
dc.relationNo. of references 116en_US
dc.rightsuniversityen_US
dc.titleCertain explorations on speech enhancement techniques for automatic speaker recognition in noisy environmenten_US
dc.creator.researcherSumithra M Gen_US
dc.subject.keywordAutomatic Speaker Recognizeren_US
dc.subject.keywordTime adaptive discrete wavelet thresholding-
dc.subject.keywordSpeech enhancement technique-
dc.description.noteNoneen_US
dc.contributor.guideThanushkodi Ken_US
dc.publisher.placeChennaien_US
dc.publisher.universityAnna Universityen_US
dc.publisher.institutionFaculty of Information and Communication Engineeringen_US
dc.date.registered2006en_US
dc.date.completed01/06/2011en_US
dc.date.awarded01/11/2011en_US
dc.format.dimensions--en_US
dc.format.accompanyingmaterialNoneen_US
dc.source.universityUniversityen_US
dc.type.degreePh.D.en_US
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File13.98 kBAdobe PDFView/Open
02_certificates.pdf530.9 kBAdobe PDFView/Open
03_abstract.pdf19.8 kBAdobe PDFView/Open
04_acknowledgement.pdf14.21 kBAdobe PDFView/Open
05_contents.pdf68.16 kBAdobe PDFView/Open
06_chapter 1.pdf86.94 kBAdobe PDFView/Open
07_chapter 2.pdf83.02 kBAdobe PDFView/Open
08_chapter 3.pdf5.51 MBAdobe PDFView/Open
09_chapter 4.pdf7.83 MBAdobe PDFView/Open
10_chapter 5.pdf1.67 MBAdobe PDFView/Open
11_chapter 6.pdf28.15 kBAdobe PDFView/Open
12_references.pdf42.02 kBAdobe PDFView/Open
13_publications.pdf24.51 kBAdobe PDFView/Open
14_vitae.pdf13.18 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: