Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/426655
Title: Noise removal feature enhancement and speech recognition techniques for artificial larynx transducer speech
Researcher: Inbanila, K
Guide(s): Kumar, E Krishna
Keywords: Acoustics sound vibration
Engineering
Engineering and Technology
University: CHRIST University
Completed Date: 2019
Abstract: Speech impediments are the state of difficulty for a person to speak comfortably. These impediments make the spoken speech distorted and they are generally categorized as disordered speech. The quality of disordered speech is poor as clarity, intelligibility and naturalness is missing. In most type of disordered speech the voice is natural and produced by the vocal system of the human being. The vocal system includes the organ called as Larynx placed in the upper part of the neck. This organ has the vocal folds that contribute for pitch variation and volume of the speech. This organ will be malfunctioning some time or will be removed because of cancer. In both the case in order to restore speech, an external device called Artificial Larynx Transducer (ALT) is used to produce the sound. It is a small handheld battery operated device and is used for decades to obtain the audible speech for people who lost their speech because of removal of larynx. The quality of speech and its intelligibility of AL speakers have not improved for decades. The reason for poor quality is constant vibration of ALT, direct sound from ALT and pressure offered to produce the vibration. newlineSo in this research the nature of the speech produced from ALT is analyzed, a possible enhancement of the parameter is done and a recognition technique of the spoken word with the help of trained data is done. Here the approach followed to tackle the problem of poor quality in AL speech involves both speech enhancement and recognizer technique development. When it is looked as enhancement problem noise region localization, noise estimation and noise suppression methods were adopted. In the process of parameter enhancement, pitch frequency estimation and improvement is implemented. When it is looked as recognition problem the parameters pitch frequency, formant frequency, glottal excitation, spectral tilt, coefficients are extracted. As formant frequency is a sensitive parameter, its estimation was done using Recurrent Neural network.
Pagination: xviii, 150p.;
URI: http://hdl.handle.net/10603/426655
Appears in Departments:Department of Electronics and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File1.15 MBAdobe PDFView/Open
02_prelim pages.pdf478.71 kBAdobe PDFView/Open
03_abstract.pdf15.86 kBAdobe PDFView/Open
04_table_of_contents.pdf10.26 kBAdobe PDFView/Open
05_chapter1.pdf67.15 kBAdobe PDFView/Open
06_chapter2.pdf471.43 kBAdobe PDFView/Open
07_chapter3.pdf383.59 kBAdobe PDFView/Open
08_chapter4.pdf369.76 kBAdobe PDFView/Open
09_chapter5.pdf265.31 kBAdobe PDFView/Open
10_chapter6.pdf336.24 kBAdobe PDFView/Open
11_chapter7.pdf10.1 kBAdobe PDFView/Open
12_annexures.pdf89.41 kBAdobe PDFView/Open
80_recommendation.pdf1.16 MBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: