Design and implementation of an efficient speaker independent speech recognition system

Uma Maheswari N

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/13796

Title:	Design and implementation of an efficient speaker independent speech recognition system
Researcher:	Uma Maheswari N
Guide(s):	Kapilan, A P
Keywords:	Independent speech recognition system, Indian English, American English, British English Speeches, Efficient Speaker, Hidden Markov Model, Probabilistic Neural Network, Recurrent Neural Network
Upload Date:	9-Dec-2013
University:	Anna University
Completed Date:	2010
Abstract:	While speaker dependent speech recognition systems have achieved close to 90% accuracy, the speaker independent speech recognition systems have poorer efficiency. Speech recognition systems used in real time applications involve complex algorithms for faithful recognition. In this thesis, we describe a Speaker Independent Speech Recognition System for Indian English, American English and British English speeches. The entire procedure is divided into four stages: the initial stage deals with the general processing of the speech input, the second stage deals with preprocessing of the input speech and learning of the sound units. The third stage performs phoneme recognition using two-level neural networks, Probabilistic Neural Network and Recurrent Neural Network. The fourth stage executes word recognition and text recognition from the string of phonemes employing Hidden Markov Model. The system is trained by Indian English speech consisting of 300 words uttered by 60 speakers. The Speaker Independent Speech Recognition system was tested for 10 Indian English speakers live and showed a recognition rate of 76.8%, the higher error rate due to ambient noise. The Speaker Independent Speech Recognition system was also tested for isolated digits from 0 to 9 uttered by 3 speakers live and achieved a recognition rate of 98.3%. Then, the Speaker Independent Speech Recognition System is trained by American English speech consisting of 250 words uttered by 50 speakers. The test samples comprised 250 words spoken by a different set of 30 speakers. The recognition accuracy is found to be 89.1% on an average which is better than the previous results. Further, the Speaker Independent newlineSpeech Recognition System is trained by British English speech consisting of 200 words uttered by 30 speakers. The test samples comprised 200 words spoken by a different set of 20 speakers. The recognition accuracy is found to be 92.3% on an average which is well above the previous results. newline newline newline
Pagination:	xix, 115
URI:	http://hdl.handle.net/10603/13796
Appears in Departments:	Faculty of Information and Communication Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	49.63 kB	Adobe PDF	View/Open
02_certificates.pdf		1 MB	Adobe PDF	View/Open
03_abstract.pdf		13.12 kB	Adobe PDF	View/Open
04_acknowledgement.pdf		13.31 kB	Adobe PDF	View/Open
05_contents.pdf		46.51 kB	Adobe PDF	View/Open
06_chapter 1.pdf		59.44 kB	Adobe PDF	View/Open
07_chapter 2.pdf		61.99 kB	Adobe PDF	View/Open
08_chapter 3.pdf		247.7 kB	Adobe PDF	View/Open
09_chapter 4.pdf		1.86 MB	Adobe PDF	View/Open
10_chapter 5.pdf		13.87 kB	Adobe PDF	View/Open
11_references.pdf		43.52 kB	Adobe PDF	View/Open
12_publications.pdf		16.66 kB	Adobe PDF	View/Open
13_vitae.pdf		10.49 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET