Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/522541
Title: | Rule based dependency parser for Telugu |
Researcher: | Sangeetha, P. |
Guide(s): | Parameswari, K. |
Keywords: | Arts and Humanities Language Language and Linguisticsn |
University: | University of Hyderabad |
Completed Date: | 2022 |
Abstract: | Abstract newlineParsing natural languages has been gaining popularity in recent years and newlineattracted the interest of Natural Language Processing (NLP) researchers around newlinethe world. It is challenging when the language under study is a free-word order newlinelanguage and morphologically rich like Telugu, the south-central Dravidian newlinelanguage. Parsing refers to the process of syntactic analysis of a specific language newlinetext. A parser is an automated tool that dissects sentences to provide newlinesyntactic/syntactico-semantic analysis of relations of words in a sentence. Parsing newlineis useful in the downstream analysis and applications of NLP such as machine newlinetranslation, document classification, dialogue modelling, etc.., newlineThis study adopts a knowledge-driven approach, i.e. a rule-based technique for newlinebuilding parser for Telugu using linguistic cues as rules. The present research newlineadopts the Indian grammatical tradition i.e. P¯an. ini s Grammatical (PG) tradition newlineas the dependency model to parse sentences. A detailed description of mapping newlinesemantic relations to vibhaktis (case suffixes and postpositions) using linguistic newlinecues in Telugu is presented. newlineAn enhanced annotation scheme for Telugu dependency relations is introduced. newlineChallenges faced in parsing ambiguous structures are elaborated alongside newlineproviding enhanced tags to handle them. The study describes the parsing newlinealgorithm and the linguistic knowledge employed while developing the parser. The newlineresearch further provides results, which suggest that enriching the current parser newlinewith linguistic inputs can increase the accuracy and tackle ambiguity better than newlineexisting data-driven methods. Results are encouraging and this parser proves to be newlineefficient for languages like Telugu which can be later extended to other newlinemorphologically-rich languages. newline |
Pagination: | 140p |
URI: | http://hdl.handle.net/10603/522541 |
Appears in Departments: | Centre for Applied Linguistics and Translation Studies |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
80_recommendation.pdf | Attached File | 821.05 kB | Adobe PDF | View/Open |
abstract.pdf | 54.32 kB | Adobe PDF | View/Open | |
annexures.pdf | 2.62 MB | Adobe PDF | View/Open | |
chapter 1.pdf | 726.14 kB | Adobe PDF | View/Open | |
chapter 2.pdf | 347.92 kB | Adobe PDF | View/Open | |
chapter 3.pdf | 1.15 MB | Adobe PDF | View/Open | |
chapter 4.pdf | 775.67 kB | Adobe PDF | View/Open | |
chapter 5.pdf | 694.87 kB | Adobe PDF | View/Open | |
chapter 6.pdf | 109.5 kB | Adobe PDF | View/Open | |
contents.pdf | 122.39 kB | Adobe PDF | View/Open | |
prelim pages.pdf | 234.21 kB | Adobe PDF | View/Open | |
title.pdf | 138.29 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: