Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/2356
Title: Multilingual acoustic modeling: a unified approach
Researcher: Santhosh Kumar, C
Guide(s): Haizhou, Li
Mohandas, V P
Keywords: Language identification systems
Communication engineering
Electronics
Multilingual acoustic models
Phonotactic language
Phonotactic LID systems
Upload Date: 23-Aug-2011
University: Amrita Vishwa Vidyapeetham (University)
Completed Date: 2011
Abstract: Spoken language processing technologies have matured enough to be used in commercial applications. Yet, services based on these technologies are not very popular in multilingual societies such as India, in spite of their wide applications. India is a multilingual society with more than 30 languages spoken across the country, and at least three languages spoken in most of the major cities. Further, people tend to mix words across languages, necessitating the use of multilingual solutions. Then, data collection for some of the languages like Tulu, a South Indian language, is extremely difficult if not impossible. People speaking the language are spread across the country, mixing with people speaking other major languages. This makes the language Tulu acoustically very diverse. Further, the total number of people speaking the language is less than a few million. The importance of a language cannot be related to the number of people speaking the language and could be even political at times. A multilingual solution would therefore be particularly attractive. When building multilingual acoustic models, language specific variations in the features increase and acoustic models becomes weak as a result. A possible solution is to capture these variations efficiently, in a way that it does not affect the performance of the system. Another alternative is to use robust features that are less sensitive to languages. Language identification is an integral part of any multilingual spoken language processing system. Language identification can help move the acoustic space of the language independent system to the subspace of the language. Or, it could be used to select the language specific acoustic model from many monolingual systems available. Phone recognition followed by language modeling, phonotactic approach, is one of the popular approaches to language identification for its simplicity and state-of-the-art performance.
Pagination: 127p.
URI: http://hdl.handle.net/10603/2356
Appears in Departments:Department of Electronics & Communication Engineering (Amrita School of Engineering)

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File70.45 kBAdobe PDFView/Open
02_certificate.pdf94.3 kBAdobe PDFView/Open
03_declaration.pdf66 kBAdobe PDFView/Open
04_dedication.pdf58.47 kBAdobe PDFView/Open
05_contents.pdf33.25 kBAdobe PDFView/Open
06_acknowledgement.pdf31.26 kBAdobe PDFView/Open
07_list of figures.pdf34.62 kBAdobe PDFView/Open
08_list of tables.pdf28.53 kBAdobe PDFView/Open
09_list of abbreviations.pdf45.61 kBAdobe PDFView/Open
10_abstract.pdf30.59 kBAdobe PDFView/Open
11_chapter 1.pdf44.18 kBAdobe PDFView/Open
12_chapter 2.pdf447.22 kBAdobe PDFView/Open
13_chapter 3.pdf64.91 kBAdobe PDFView/Open
14_chapter 4.pdf240.8 kBAdobe PDFView/Open
15_chapter 5.pdf97.24 kBAdobe PDFView/Open
16_chapter 6.pdf160.46 kBAdobe PDFView/Open
17_chapter 7.pdf72.26 kBAdobe PDFView/Open
18_appendix.pdf299.85 kBAdobe PDFView/Open
19_bibliography.pdf105.43 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: