Robust Framework for Speaker Independent Tamil Speech Recognition under Noisy Environments using Modified GFCC Features and Machine Learning Techniques

Vimala C

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/40735

Title:	Robust Framework for Speaker Independent Tamil Speech Recognition under Noisy Environments using Modified GFCC Features and Machine Learning Techniques
Researcher:	Vimala C
Guide(s):	Radha V
Keywords:	Speech Recognition Tamil Language Gammatone Frequency Cochleagram Coefficients Hidden Markov Models Support Vector Machine Multi Taper Windowing Yule Walker Auto Regressive Speech Signal Enhancement
Upload Date:	9-May-2015
University:	Avinashilingam Deemed University For Women
Completed Date:	14/11/2014
Abstract:	The main objective of the research work is to increase the robustness of an Automatic Speech Recognition ASR for Tamil language by introducing an efficient speech front end newlineprocessing techniques The methodology of the proposed work is carried out in three phases In Phase I a framework has been developed with the aid of existing feature extraction newlineand speech recognition techniques for both noise free and noisy data Best techniques have been selected from both types of data based on Word Recognition Rate WRR and Real Time Factor RTF and only the selected techniques are been used in Phase II for achieving further improvements In Phase II the factors affecting the performance of ASR are analyzed and identified The solutions to the identified problems are carefully developed which can be highly suitable for both noise free and noisy environments Five pass pre processing and three modified GFCC features using multi taper Yule Walker AR power spectrum combinational features using formant frequencies combined frequency warping and feature normalizat ion using LPC and Cepstral Mean Normalization CMN are developed The performance improvements of these techniques are assessed initially for noise free data later the robustness of the same proposed techniques are evaluated for different noisy conditions It is proved from the experiments that the proposed techniques are found to be robust and efficient in terms of improving the recognition accuracy for both noise free and noisy conditions In order to increase the performance of noisy speech recognition various speech signal enhancement techniques are implemented in Phase III and they are evaluated using both subjective and objective speech quality measures Based on the outcome the Recursive Least Squares RLS adaptive algorithm is selected and further improved by introducing a reconstruction methodology using Dual Tree Complex Wavelet DTCW Transform Finally the performance of the noisy speech recognition is evaluated before and after applying the RLSDTCW technique
Pagination:
URI:	http://hdl.handle.net/10603/40735
Appears in Departments:	Department of Computer Science

Files in This Item:

File	Description	Size	Format
cvimala_chapter1.pdf	Attached File	470.49 kB	Adobe PDF	View/Open
cvimala_chapter2.pdf		344.49 kB	Adobe PDF	View/Open
cvimala_chapter3.pdf		1.19 MB	Adobe PDF	View/Open
cvimala_chapter4.pdf		331.72 kB	Adobe PDF	View/Open
cvimala_intro.pdf		331.83 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET