Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/457910
Title: | Issues and Challenges for Machine Translation of Gujarati to English Language |
Researcher: | Patel,Chirag D |
Guide(s): | Ahalpara,Dilip P |
Keywords: | Artificial Intelligence Computer Science Computer Science Artificial Intelligence Engineering and Technology Machine Translation Natural Language Processing |
University: | Dharmsinh Desai University |
Completed Date: | 2019 |
Abstract: | We propose an algorithmic approach to build a Machine Translation System for Gujarati language, by developing an adequate software support along with manually created Dictionaries/ Corpuses. Focus has been made on building a Paninian Grammar based partial Machine Translation system, where we have used language specific rules, classification algorithms and manually prepared corpuses for statistical analysis wherever required. This approach is based on a layered structure to fulfil a complete Machine Translation of Natural Language from Gujarati to English and vice versa. At each layer, the necessary and sufficient information, namely the word, phrase and sentence level information, is extracted from the input using the software. The software uses manually tagged Corpuses as learning databases. Keeping the primary focus on the translation phase, a given sentence at the source language is translated to target language using the word, phrase and sentence level information collected during the previous phases. Noteworthy results have been attained for different phases of Machine Translation, such as POS (Part Of Speech) Tagger with 96.46% accuracy, Shallow Parser (Chunker) with 96.22% accuracy and Morph Generator/Analyzer with 100% accuracy. Out of various categories of words, such as Noun, Adjective, Verb, Adverb etc, the Morph Analyzer has been built only for Noun words. This essentially helps prove the efficiency of the algorithm. An algorithm for the final phase, i.e. the Parser, has been shown to be effective for simple sentences as is highlighted later in the thesis. The implementation of the algorithm requires manually created verb frames and a full working Morph Analyzer and hence has been excluded in the present work. Notably, the present work is believed to be first of its kind for Machine Translation for Gujarati language. newline newline |
Pagination: | 188 |
URI: | http://hdl.handle.net/10603/457910 |
Appears in Departments: | Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 58.41 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 95.55 kB | Adobe PDF | View/Open | |
03_content.pdf | 38.4 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 36.23 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 799.09 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 531 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 111.61 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 1.18 MB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 591.45 kB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 136.25 kB | Adobe PDF | View/Open | |
12_chapter 8.pdf | 673.17 kB | Adobe PDF | View/Open | |
13_chapter 9.pdf | 528.33 kB | Adobe PDF | View/Open | |
14_annexures.pdf | 75 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 99.88 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: