Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/522541
Title: Rule based dependency parser for Telugu
Researcher: Sangeetha, P.
Guide(s): Parameswari, K.
Keywords: Arts and Humanities
Language
Language and Linguisticsn
University: University of Hyderabad
Completed Date: 2022
Abstract: Abstract newlineParsing natural languages has been gaining popularity in recent years and newlineattracted the interest of Natural Language Processing (NLP) researchers around newlinethe world. It is challenging when the language under study is a free-word order newlinelanguage and morphologically rich like Telugu, the south-central Dravidian newlinelanguage. Parsing refers to the process of syntactic analysis of a specific language newlinetext. A parser is an automated tool that dissects sentences to provide newlinesyntactic/syntactico-semantic analysis of relations of words in a sentence. Parsing newlineis useful in the downstream analysis and applications of NLP such as machine newlinetranslation, document classification, dialogue modelling, etc.., newlineThis study adopts a knowledge-driven approach, i.e. a rule-based technique for newlinebuilding parser for Telugu using linguistic cues as rules. The present research newlineadopts the Indian grammatical tradition i.e. P¯an. ini s Grammatical (PG) tradition newlineas the dependency model to parse sentences. A detailed description of mapping newlinesemantic relations to vibhaktis (case suffixes and postpositions) using linguistic newlinecues in Telugu is presented. newlineAn enhanced annotation scheme for Telugu dependency relations is introduced. newlineChallenges faced in parsing ambiguous structures are elaborated alongside newlineproviding enhanced tags to handle them. The study describes the parsing newlinealgorithm and the linguistic knowledge employed while developing the parser. The newlineresearch further provides results, which suggest that enriching the current parser newlinewith linguistic inputs can increase the accuracy and tackle ambiguity better than newlineexisting data-driven methods. Results are encouraging and this parser proves to be newlineefficient for languages like Telugu which can be later extended to other newlinemorphologically-rich languages. newline
Pagination: 140p
URI: http://hdl.handle.net/10603/522541
Appears in Departments:Centre for Applied Linguistics and Translation Studies

Files in This Item:
File Description SizeFormat 
80_recommendation.pdfAttached File821.05 kBAdobe PDFView/Open
abstract.pdf54.32 kBAdobe PDFView/Open
annexures.pdf2.62 MBAdobe PDFView/Open
chapter 1.pdf726.14 kBAdobe PDFView/Open
chapter 2.pdf347.92 kBAdobe PDFView/Open
chapter 3.pdf1.15 MBAdobe PDFView/Open
chapter 4.pdf775.67 kBAdobe PDFView/Open
chapter 5.pdf694.87 kBAdobe PDFView/Open
chapter 6.pdf109.5 kBAdobe PDFView/Open
contents.pdf122.39 kBAdobe PDFView/Open
prelim pages.pdf234.21 kBAdobe PDFView/Open
title.pdf138.29 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: