Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/156348
Title: | A Paninian perspective to information dynamics in language mapping structures between English and Hindi |
Researcher: | Sukhada |
Guide(s): | Dipti Misra Sharma |
University: | International Institute of Information Technology, Hyderabad |
Completed Date: | 14/06/2017 |
Abstract: | In the era of globalization, vast amount of information is available on World Wide Web. In order to access this information, the speakers of different languages have to rely on translation, hence the need of Machine Translation (MT) arises. newline newlinePaninian Grammar (PG) universally admired for its insightful analysis of Sanskrit offers a perspective to language analysis. The main focus of PG is to explain how, where and how much information a language encodes which helps in capturing information dynamics in MT. In order to use Paninian theory to analyse other languages, we model these languages in terms of Paninian primitives such as sup, tin, pada, samasta-pada, etc. In our research, we have used these concepts to analyse English from Paninian perspective and then map English structures into Hindi for handling following major linguistic tasks: newline newline1. Identifying the Meaningful Linguistic Units That Can Participate in a Sentence: This step theoretically accounts English sentence in terms of Paninian primitives such as sup, tiand#729;n, pada and samasta-pada. newline2. Mapping Dependency Parsers Output into a Uniform Schema: Bringing parsers output into one uniform notation provides the system an ability to plug-in any of these parsers without modifying other sub-sequent modules thereby saving large amount of manual work, cost and time. newline3. Framing Transfer Grammar Rules: Word ordering plays an important role in the translation process between languages. Using the concept of pada and samasta-pada the source language sentence is manipulated into target language sentence for natural translation. newline newlineThe main focus of this study is on MT between two diverse languages English and Hindi, But the research carried out in this thesis is generic enough to be applied to any English-Indian Language pair. Because most of the Indian languages have many common features and the transparency of the tools built during this research allows to cover diverse cases easily. |
Pagination: | |
URI: | http://hdl.handle.net/10603/156348 |
Appears in Departments: | Computational Linguistics |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 82.05 kB | Adobe PDF | View/Open |
02_certificate.pdf | 21.45 kB | Adobe PDF | View/Open | |
03_acknowlegdements.pdf | 35.64 kB | Adobe PDF | View/Open | |
04_contents.pdf | 82.85 kB | Adobe PDF | View/Open | |
05_preface.pdf | 67.06 kB | Adobe PDF | View/Open | |
06_list-of-tables-figures.pdf | 71.64 kB | Adobe PDF | View/Open | |
07_chapter 1.pdf | 91.3 kB | Adobe PDF | View/Open | |
08_chapter 2.pdf | 113.56 kB | Adobe PDF | View/Open | |
09_chapter 3.pdf | 410.96 kB | Adobe PDF | View/Open | |
10_chapter 4.pdf | 330.4 kB | Adobe PDF | View/Open | |
11_chapter 5.pdf | 170.89 kB | Adobe PDF | View/Open | |
12_chapter 6.pdf | 137.71 kB | Adobe PDF | View/Open | |
13_chapter 7.pdf | 77.06 kB | Adobe PDF | View/Open | |
14_appendix a&b.pdf | 103.54 kB | Adobe PDF | View/Open | |
15_references.pdf | 98.77 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: