Rule based dependency parser for Telugu

Sangeetha, P.

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/522541

Title:	Rule based dependency parser for Telugu
Researcher:	Sangeetha, P.
Guide(s):	Parameswari, K.
Keywords:	Arts and Humanities Language Language and Linguisticsn
University:	University of Hyderabad
Completed Date:	2022
Abstract:	Abstract newlineParsing natural languages has been gaining popularity in recent years and newlineattracted the interest of Natural Language Processing (NLP) researchers around newlinethe world. It is challenging when the language under study is a free-word order newlinelanguage and morphologically rich like Telugu, the south-central Dravidian newlinelanguage. Parsing refers to the process of syntactic analysis of a specific language newlinetext. A parser is an automated tool that dissects sentences to provide newlinesyntactic/syntactico-semantic analysis of relations of words in a sentence. Parsing newlineis useful in the downstream analysis and applications of NLP such as machine newlinetranslation, document classification, dialogue modelling, etc.., newlineThis study adopts a knowledge-driven approach, i.e. a rule-based technique for newlinebuilding parser for Telugu using linguistic cues as rules. The present research newlineadopts the Indian grammatical tradition i.e. P¯an. ini s Grammatical (PG) tradition newlineas the dependency model to parse sentences. A detailed description of mapping newlinesemantic relations to vibhaktis (case suffixes and postpositions) using linguistic newlinecues in Telugu is presented. newlineAn enhanced annotation scheme for Telugu dependency relations is introduced. newlineChallenges faced in parsing ambiguous structures are elaborated alongside newlineproviding enhanced tags to handle them. The study describes the parsing newlinealgorithm and the linguistic knowledge employed while developing the parser. The newlineresearch further provides results, which suggest that enriching the current parser newlinewith linguistic inputs can increase the accuracy and tackle ambiguity better than newlineexisting data-driven methods. Results are encouraging and this parser proves to be newlineefficient for languages like Telugu which can be later extended to other newlinemorphologically-rich languages. newline
Pagination:	140p
URI:	http://hdl.handle.net/10603/522541
Appears in Departments:	Centre for Applied Linguistics and Translation Studies

Files in This Item:

File	Description	Size	Format
80_recommendation.pdf	Attached File	821.05 kB	Adobe PDF	View/Open
abstract.pdf		54.32 kB	Adobe PDF	View/Open
annexures.pdf		2.62 MB	Adobe PDF	View/Open
chapter 1.pdf		726.14 kB	Adobe PDF	View/Open
chapter 2.pdf		347.92 kB	Adobe PDF	View/Open
chapter 3.pdf		1.15 MB	Adobe PDF	View/Open
chapter 4.pdf		775.67 kB	Adobe PDF	View/Open
chapter 5.pdf		694.87 kB	Adobe PDF	View/Open
chapter 6.pdf		109.5 kB	Adobe PDF	View/Open
contents.pdf		122.39 kB	Adobe PDF	View/Open
prelim pages.pdf		234.21 kB	Adobe PDF	View/Open
title.pdf		138.29 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET