A Paninian perspective to information dynamics in language mapping structures between English and Hindi

Sukhada

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/156348

Title:	A Paninian perspective to information dynamics in language mapping structures between English and Hindi
Researcher:	Sukhada
Guide(s):	Dipti Misra Sharma
University:	International Institute of Information Technology, Hyderabad
Completed Date:	14/06/2017
Abstract:	In the era of globalization, vast amount of information is available on World Wide Web. In order to access this information, the speakers of different languages have to rely on translation, hence the need of Machine Translation (MT) arises. newline newlinePaninian Grammar (PG) universally admired for its insightful analysis of Sanskrit offers a perspective to language analysis. The main focus of PG is to explain how, where and how much information a language encodes which helps in capturing information dynamics in MT. In order to use Paninian theory to analyse other languages, we model these languages in terms of Paninian primitives such as sup, tin, pada, samasta-pada, etc. In our research, we have used these concepts to analyse English from Paninian perspective and then map English structures into Hindi for handling following major linguistic tasks: newline newline1. Identifying the Meaningful Linguistic Units That Can Participate in a Sentence: This step theoretically accounts English sentence in terms of Paninian primitives such as sup, tiand#729;n, pada and samasta-pada. newline2. Mapping Dependency Parsers Output into a Uniform Schema: Bringing parsers output into one uniform notation provides the system an ability to plug-in any of these parsers without modifying other sub-sequent modules thereby saving large amount of manual work, cost and time. newline3. Framing Transfer Grammar Rules: Word ordering plays an important role in the translation process between languages. Using the concept of pada and samasta-pada the source language sentence is manipulated into target language sentence for natural translation. newline newlineThe main focus of this study is on MT between two diverse languages English and Hindi, But the research carried out in this thesis is generic enough to be applied to any English-Indian Language pair. Because most of the Indian languages have many common features and the transparency of the tools built during this research allows to cover diverse cases easily.
Pagination:
URI:	http://hdl.handle.net/10603/156348
Appears in Departments:	Computational Linguistics

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	82.05 kB	Adobe PDF	View/Open
02_certificate.pdf		21.45 kB	Adobe PDF	View/Open
03_acknowlegdements.pdf		35.64 kB	Adobe PDF	View/Open
04_contents.pdf		82.85 kB	Adobe PDF	View/Open
05_preface.pdf		67.06 kB	Adobe PDF	View/Open
06_list-of-tables-figures.pdf		71.64 kB	Adobe PDF	View/Open
07_chapter 1.pdf		91.3 kB	Adobe PDF	View/Open
08_chapter 2.pdf		113.56 kB	Adobe PDF	View/Open
09_chapter 3.pdf		410.96 kB	Adobe PDF	View/Open
10_chapter 4.pdf		330.4 kB	Adobe PDF	View/Open
11_chapter 5.pdf		170.89 kB	Adobe PDF	View/Open
12_chapter 6.pdf		137.71 kB	Adobe PDF	View/Open
13_chapter 7.pdf		77.06 kB	Adobe PDF	View/Open
14_appendix a&b.pdf		103.54 kB	Adobe PDF	View/Open
15_references.pdf		98.77 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET