Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/476964
Title: Development and evaluation of hybrid machine translation systems for english to indian language under low resource conditions
Researcher: Mrinalini Kannan
Guide(s): Vijayalakshmi P
Keywords: Bilingual Language
Machine Translation
Parts-of-Speech
University: Anna University
Completed Date: 2022
Abstract: India is a multi-cultural and multilingual country with 22 official newlinelanguages belonging to different linguistic families. Since the time of British newlinecolonial rule in India, English has been used as the linguistic medium (L1 newlinelanguage) for administrative and higher education purposes. Postindependence, newlinethe use of regional languages (as L1 language) along with newlineEnglish (as L2 language) has been encouraged in the states across the country. newlineHowever, the usage of either L1 or L2 language varies among the common newlinepeople. Thus, it is essential to develop machine translation (MT) systems newlinefrom English-to-Indian languages for smoother transactions and newlinecommunication across the country. Among the seven linguistic families of newlineSouth Asia, Indo-Aryan and Dravidian languages account for over 90% of newlineIndian speakers. On this note, the current research work proposes to develop newlinean efficient statistical-based (SMT) and neural-based (NMT) machine newlinetranslation systems for translation from English to two Indian languages newlinenamely, Tamil (a Dravidian language) and Hindi (an Indo-Aryan language). newlineMost of the well-established data-driven approaches for developing newlineMT systems require huge amount of parallel text in the source and target newlinelanguage, to train an efficient translation model. Availability of such huge newlineparallel corpora between English and Indian languages is scarce. Further, newlinedomain-specific parallel corpora required to develop highly efficient and newlineapplication-oriented MT systems are also not available. The proposed work newlinemakes use of punctuation marks and re-ordering to augment the parallel data newlineavailable for training the SMT and NMT systems. newline
Pagination: xix,183p.
URI: http://hdl.handle.net/10603/476964
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File22.31 kBAdobe PDFView/Open
02_prelim pages.pdf2.32 MBAdobe PDFView/Open
03_contents.pdf126.87 kBAdobe PDFView/Open
04_abstracts.pdf84.37 kBAdobe PDFView/Open
05_chapter1.pdf385.33 kBAdobe PDFView/Open
06_chapter2.pdf429.65 kBAdobe PDFView/Open
07_chapter3.pdf2.27 MBAdobe PDFView/Open
08_chapter4.pdf1.45 MBAdobe PDFView/Open
09_chapter5.pdf563.12 kBAdobe PDFView/Open
10_annexures.pdf111.21 kBAdobe PDFView/Open
80_recommendation.pdf95.19 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: