Hybrid Approach Based Lexical and Morphosyntactic Components Modelling for Resolving Pronominal Anaphora in Gujarati Text

Tailor Chetanaben Maheshbhai

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/373529

Full metadata record

DC Field	Value	Language
dc.coverage.spatial
dc.date.accessioned	2022-04-12T05:23:21Z	-
dc.date.available	2022-04-12T05:23:21Z	-
dc.identifier.uri	http://hdl.handle.net/10603/373529	-
dc.description.abstract	The present epoch has witnessed much research and enhancement in the field of NLP. In this digitized era, the major text is in the english language, but only 10% of people in India understand the English language. Many people living in rural communities neither understand nor speak the English language. Therefore, to realize the dream of Digital India, local languages are given more focus now a day which is a dearth need for research to remove the language barrier. newlineThe research work has been started in natural languages to develop different NLP applications such as Machine Translation, Text Summarization, Question Answering systems etc. A significant amount of work has been recorded for foreign languages, but no significant work has been recorded for Pronominal Anaphora Resolution for Gujarati text, even though it contributes in developing NLP applications.So, the objective of this research work is to study the anaphora and find its suitable newlineantecedents automatically in Gujarati text discourse. It requires the pre-processing components such as Sentence tokenizer, POS Tagger, Chunker, and Morphological newlineAnalyzer. newlineA sentence tokenizer, which is not only useful in Anaphora Resolution but also valuable for Text summarization, POS tagger, and Chunker development too, is the newlineinitial step in text processing. Dot, exclamation marks, single quotes, double quotes,question marks, and consecutive multiple occurrences of sentence end markers are considered as sentence end markers. A statistical model, namely Punkt, has been developed using the Gujarati news article corpus. Linguistic rules are designed to newlinehandle the issues such as the abbreviation, the parenthetical expressions, the order list covering different patterns, and quotation marks ambiguity due to different types newlineof quotation marks as well as direct speech sentences. An average accuracy achieved is 99.34% using corpus consisting of the six different article categories, namely newlineBusiness, Crime, Politics, Sports, Technical, and Vaividhya including EMILLE corpus.
dc.format.extent	xxii,182p
dc.language	English
dc.relation
dc.rights	university
dc.title	Hybrid Approach Based Lexical and Morphosyntactic Components Modelling for Resolving Pronominal Anaphora in Gujarati Text
dc.title.alternative
dc.creator.researcher	Tailor Chetanaben Maheshbhai
dc.subject.keyword	anaphora resolution - Gujarati language
dc.subject.keyword	Computer Science
dc.subject.keyword	Natural Language Processing
dc.description.note
dc.contributor.guide	Patel Bankim
dc.publisher.place	Barodli
dc.publisher.university	Uka Tarsadia University
dc.publisher.institution	Faculty of Computer Science
dc.date.registered	2015
dc.date.completed	2022
dc.date.awarded	2022
dc.format.dimensions
dc.format.accompanyingmaterial	CD
dc.source.university	University
dc.type.degree	Ph.D.
Appears in Departments:	Faculty of Computer Science

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	497.01 kB	Adobe PDF	View/Open
02_declaration.pdf		383.25 kB	Adobe PDF	View/Open
03_certificates.pdf		2.07 MB	Adobe PDF	View/Open
04_acknowledgement.pdf		869.61 kB	Adobe PDF	View/Open
05_content.pdf		1.7 MB	Adobe PDF	View/Open
06_preface.pdf		604.25 kB	Adobe PDF	View/Open
07_chapter 1.pdf		1.58 MB	Adobe PDF	View/Open
08_chapter_2.pdf		1.42 MB	Adobe PDF	View/Open
09_chapter_3.pdf		1.19 MB	Adobe PDF	View/Open
10_chapter_4.pdf		1.52 MB	Adobe PDF	View/Open
11_chapter_5.pdf		1.64 MB	Adobe PDF	View/Open
12_chapter_6.pdf		1.45 MB	Adobe PDF	View/Open
13_chapter_7.pdf		1.45 MB	Adobe PDF	View/Open
14_chapter_8.pdf		1.12 MB	Adobe PDF	View/Open
15_chapter_9.pdf		669.35 kB	Adobe PDF	View/Open
16_chapter_10.pdf		855.22 kB	Adobe PDF	View/Open
17_references.pdf		940.28 kB	Adobe PDF	View/Open
18_plagiarism_report.pdf		521.24 kB	Adobe PDF	View/Open
80_recommendation.pdf		1.75 MB	Adobe PDF	View/Open

Show simple item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET