Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/2356
Title: | Multilingual acoustic modeling: a unified approach |
Researcher: | Santhosh Kumar, C |
Guide(s): | Haizhou, Li Mohandas, V P |
Keywords: | Language identification systems Communication engineering Electronics Multilingual acoustic models Phonotactic language Phonotactic LID systems |
Upload Date: | 23-Aug-2011 |
University: | Amrita Vishwa Vidyapeetham (University) |
Completed Date: | 2011 |
Abstract: | Spoken language processing technologies have matured enough to be used in commercial applications. Yet, services based on these technologies are not very popular in multilingual societies such as India, in spite of their wide applications. India is a multilingual society with more than 30 languages spoken across the country, and at least three languages spoken in most of the major cities. Further, people tend to mix words across languages, necessitating the use of multilingual solutions. Then, data collection for some of the languages like Tulu, a South Indian language, is extremely difficult if not impossible. People speaking the language are spread across the country, mixing with people speaking other major languages. This makes the language Tulu acoustically very diverse. Further, the total number of people speaking the language is less than a few million. The importance of a language cannot be related to the number of people speaking the language and could be even political at times. A multilingual solution would therefore be particularly attractive. When building multilingual acoustic models, language specific variations in the features increase and acoustic models becomes weak as a result. A possible solution is to capture these variations efficiently, in a way that it does not affect the performance of the system. Another alternative is to use robust features that are less sensitive to languages. Language identification is an integral part of any multilingual spoken language processing system. Language identification can help move the acoustic space of the language independent system to the subspace of the language. Or, it could be used to select the language specific acoustic model from many monolingual systems available. Phone recognition followed by language modeling, phonotactic approach, is one of the popular approaches to language identification for its simplicity and state-of-the-art performance. |
Pagination: | 127p. |
URI: | http://hdl.handle.net/10603/2356 |
Appears in Departments: | Department of Electronics & Communication Engineering (Amrita School of Engineering) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 70.45 kB | Adobe PDF | View/Open |
02_certificate.pdf | 94.3 kB | Adobe PDF | View/Open | |
03_declaration.pdf | 66 kB | Adobe PDF | View/Open | |
04_dedication.pdf | 58.47 kB | Adobe PDF | View/Open | |
05_contents.pdf | 33.25 kB | Adobe PDF | View/Open | |
06_acknowledgement.pdf | 31.26 kB | Adobe PDF | View/Open | |
07_list of figures.pdf | 34.62 kB | Adobe PDF | View/Open | |
08_list of tables.pdf | 28.53 kB | Adobe PDF | View/Open | |
09_list of abbreviations.pdf | 45.61 kB | Adobe PDF | View/Open | |
10_abstract.pdf | 30.59 kB | Adobe PDF | View/Open | |
11_chapter 1.pdf | 44.18 kB | Adobe PDF | View/Open | |
12_chapter 2.pdf | 447.22 kB | Adobe PDF | View/Open | |
13_chapter 3.pdf | 64.91 kB | Adobe PDF | View/Open | |
14_chapter 4.pdf | 240.8 kB | Adobe PDF | View/Open | |
15_chapter 5.pdf | 97.24 kB | Adobe PDF | View/Open | |
16_chapter 6.pdf | 160.46 kB | Adobe PDF | View/Open | |
17_chapter 7.pdf | 72.26 kB | Adobe PDF | View/Open | |
18_appendix.pdf | 299.85 kB | Adobe PDF | View/Open | |
19_bibliography.pdf | 105.43 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: