Noise removal feature enhancement and speech recognition techniques for artificial larynx transducer speech

Inbanila, K

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/426655

Title:	Noise removal feature enhancement and speech recognition techniques for artificial larynx transducer speech
Researcher:	Inbanila, K
Guide(s):	Kumar, E Krishna
Keywords:	Acoustics sound vibration Engineering Engineering and Technology
University:	CHRIST University
Completed Date:	2019
Abstract:	Speech impediments are the state of difficulty for a person to speak comfortably. These impediments make the spoken speech distorted and they are generally categorized as disordered speech. The quality of disordered speech is poor as clarity, intelligibility and naturalness is missing. In most type of disordered speech the voice is natural and produced by the vocal system of the human being. The vocal system includes the organ called as Larynx placed in the upper part of the neck. This organ has the vocal folds that contribute for pitch variation and volume of the speech. This organ will be malfunctioning some time or will be removed because of cancer. In both the case in order to restore speech, an external device called Artificial Larynx Transducer (ALT) is used to produce the sound. It is a small handheld battery operated device and is used for decades to obtain the audible speech for people who lost their speech because of removal of larynx. The quality of speech and its intelligibility of AL speakers have not improved for decades. The reason for poor quality is constant vibration of ALT, direct sound from ALT and pressure offered to produce the vibration. newlineSo in this research the nature of the speech produced from ALT is analyzed, a possible enhancement of the parameter is done and a recognition technique of the spoken word with the help of trained data is done. Here the approach followed to tackle the problem of poor quality in AL speech involves both speech enhancement and recognizer technique development. When it is looked as enhancement problem noise region localization, noise estimation and noise suppression methods were adopted. In the process of parameter enhancement, pitch frequency estimation and improvement is implemented. When it is looked as recognition problem the parameters pitch frequency, formant frequency, glottal excitation, spectral tilt, coefficients are extracted. As formant frequency is a sensitive parameter, its estimation was done using Recurrent Neural network.
Pagination:	xviii, 150p.;
URI:	http://hdl.handle.net/10603/426655
Appears in Departments:	Department of Electronics and Communication Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	1.15 MB	Adobe PDF	View/Open
02_prelim pages.pdf		478.71 kB	Adobe PDF	View/Open
03_abstract.pdf		15.86 kB	Adobe PDF	View/Open
04_table_of_contents.pdf		10.26 kB	Adobe PDF	View/Open
05_chapter1.pdf		67.15 kB	Adobe PDF	View/Open
06_chapter2.pdf		471.43 kB	Adobe PDF	View/Open
07_chapter3.pdf		383.59 kB	Adobe PDF	View/Open
08_chapter4.pdf		369.76 kB	Adobe PDF	View/Open
09_chapter5.pdf		265.31 kB	Adobe PDF	View/Open
10_chapter6.pdf		336.24 kB	Adobe PDF	View/Open
11_chapter7.pdf		10.1 kB	Adobe PDF	View/Open
12_annexures.pdf		89.41 kB	Adobe PDF	View/Open
80_recommendation.pdf		1.16 MB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET