Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/218574
Full metadata record
DC FieldValueLanguage
dc.coverage.spatial
dc.date.accessioned2018-10-16T04:57:59Z-
dc.date.available2018-10-16T04:57:59Z-
dc.identifier.urihttp://hdl.handle.net/10603/218574-
dc.description.abstractWith many advances made in automatic speech recognition technology over past few newlinedecades, there is now an increasing demand of developing Indian ASR. There is a huge newlinegap between performance of machines and a human due to lack of resources, complexity newlinein handling feature vectors, decorrelation of feature information and robustness beside newlineever increasing changes in input speech conditions. Different approaches have been newlineexamined to tackle these factors. The aim of the proposed research work is to cope with newlinethese issues through refinement, combination, and integration of front and back end newlineapproaches with different methodologies. newlineOne of the solution to overcome thesis issues is to explore optimization techniques for newlinetraining (GA+HMM, DE+HMM) of large corpora. The proposed method is applied on newlinebaseline acoustic modeling approaches in training stage. We use the stratergies to newlinedevelop them for most frequently used and language resources levied. Punjabi language newlinefalls in this category but for that we need to first build Punjabi speech dataset. So, in this newlinethesis we first build Punjabi speech corpora of isolated and continuous sentences spoken newlineby adult Punjabi speakers. Its performance is not suggested to be productive on large newlinecorpus with traditional approaches at front and back ends of the system. To reduce newlinefeature vector complexity in training stage, Mel Frequency Cepstral Coefficient (MFCC) newlinefeature vectors are combined with optimization algorithms. It refines the processed newlinefeature vectors before performing classification using baseline hidden Markov model newline(HMM) approach. The experiments are then conducted on large vocabulary of Punjabi newlineisolated words. newlineDespite the improvement in performance, a large gap exists due to the mismatch newlinebetween train and test conditions. We try to reduce this gap by using different newlinecombinations of front end approaches (MFCC, Perceptual Linear Prediction (PLP), newlineRelative Spectral Transform (RASTA) - PLP or their fusion). We test them on each newlinerefined modeling approaches through baseline or
dc.format.extent
dc.languageEnglish
dc.relation
dc.rightsuniversity
dc.titleACOUSTIC FEATURES OPTIMIZATION FOR PUNJABI AUTOMATIC SPEECH RECOGNITION SYSTEM
dc.title.alternative
dc.creator.researcherVirender Kadyan
dc.subject.keywordAcoustic Feature Refinement
dc.subject.keywordGFCC
dc.subject.keywordGMM
dc.subject.keywordHMM
dc.subject.keywordMFCC
dc.subject.keywordMF-PLP
dc.subject.keywordMultitaper
dc.subject.keywordPCA
dc.subject.keywordPLP
dc.subject.keywordPunjabi Speech Recognition
dc.description.note
dc.contributor.guideArchana Mantri
dc.publisher.placeChandigarh
dc.publisher.universityChitkara University, Punjab
dc.publisher.institutionFaculty of Computer Science
dc.date.registered15/05/2015
dc.date.completed09/07/2018
dc.date.awarded2018
dc.format.dimensions
dc.format.accompanyingmaterialDVD
dc.source.universityUniversity
dc.type.degreePh.D.
Appears in Departments:Faculty of Computer Science

Files in This Item:
File Description SizeFormat 
abstract.pdfAttached File181.34 kBAdobe PDFView/Open
bibliography (2).pdf371.01 kBAdobe PDFView/Open
chapter1.pdf738.35 kBAdobe PDFView/Open
chapter2.pdf551.75 kBAdobe PDFView/Open
chapter3.pdf1.44 MBAdobe PDFView/Open
chapter4.pdf736.94 kBAdobe PDFView/Open
chapter5.pdf918.83 kBAdobe PDFView/Open
chapter6.pdf640.51 kBAdobe PDFView/Open
chapter7 (1).pdf198.01 kBAdobe PDFView/Open
contents.pdf189.88 kBAdobe PDFView/Open
front_pages.pdf580.32 kBAdobe PDFView/Open
list_of_publications.pdf175.86 kBAdobe PDFView/Open
thesis_main_page_jmc.pdf182.99 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: