Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/183865
Title: Hindi Speech Recognition Using Hidden Markov Model Tool Kit
Researcher: Sharmila
Guide(s): Awasthy Neeta
Keywords: ASR,HMM,HTK,LDA,MFCC,PLP,LPC
University: Uttarakhand Technical University
Completed Date: 8-8-2016
Abstract: Speech is most effective initial communication medium. It is also relates to production and perception. Speech is a medium of transferring information emotions and thoughts. Moreover it is also a unique medium of passing human intellect from one member to another. In science the human voice has long been an accepted feature.The purpose of this final Doctor of Philosophy degree research was to develop a Hindi speech recognition tool using HTK and to make the technology more accessible. The development includes an extensive study of Hidden Markov Model which is presently stated as the most popular tool in the field of speech recognition. The use of Hidden Markov Model Tool Kit HTK for speech recognition has become most predominant in the last several years as evidence by the number of published research papers at major conferences on speech. Database of isolated digits is prepared in clean and noisy environment. This research for acoustic features is explained with full description on Hindi speech recognition using HTK for isolated digits. In first phase of the research it is observed that Revised Perceptual Linear Prediction RPLP is giving best recognition efficiency amongst most of the conventional methods. In the past research pertaining to speech recognition was limited to the acoustic component of speech. Soundbased research has had many applications. This audiobased research gave rise to many landmark innovations.So there is in need to get Audio Visual features for better results of recognition. Audiovisual features play a very important role in ASR systems in presence of noise. In this thesis Hindi speech recognition system is designed using audio visual features also. In real life this speech recognition technology is very useful in traffic security or facilitates functional disability for people. This research work is provided better results for both Audio and Visual. newline newline newline newline newline newline
Pagination: 171 pages
URI: http://hdl.handle.net/10603/183865
Appears in Departments:Department of Electronics and Communication Engineering

Files in This Item:
File Description SizeFormat 
01- title page.pdfAttached File85.44 kBAdobe PDFView/Open
02- certificate.pdf234.84 kBAdobe PDFView/Open
03- contents.pdf65.64 kBAdobe PDFView/Open
04- list of tables.pdf110.87 kBAdobe PDFView/Open
05-list of figures.pdf77.19 kBAdobe PDFView/Open
06-list of abbreviation.pdf63.61 kBAdobe PDFView/Open
07-chapter 1.pdf243.77 kBAdobe PDFView/Open
08-chapter 2.pdf622.08 kBAdobe PDFView/Open
09-chapter 3.pdf747.47 kBAdobe PDFView/Open
10-chapter 4.pdf1.16 MBAdobe PDFView/Open
11-chapter 5.pdf371.62 kBAdobe PDFView/Open
12-chapter 6.pdf73.99 kBAdobe PDFView/Open
13-references.pdf133.31 kBAdobe PDFView/Open
14-appendices.pdf256.09 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: