Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/373529
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.coverage.spatial | ||
dc.date.accessioned | 2022-04-12T05:23:21Z | - |
dc.date.available | 2022-04-12T05:23:21Z | - |
dc.identifier.uri | http://hdl.handle.net/10603/373529 | - |
dc.description.abstract | The present epoch has witnessed much research and enhancement in the field of NLP. In this digitized era, the major text is in the english language, but only 10% of people in India understand the English language. Many people living in rural communities neither understand nor speak the English language. Therefore, to realize the dream of Digital India, local languages are given more focus now a day which is a dearth need for research to remove the language barrier. newlineThe research work has been started in natural languages to develop different NLP applications such as Machine Translation, Text Summarization, Question Answering systems etc. A significant amount of work has been recorded for foreign languages, but no significant work has been recorded for Pronominal Anaphora Resolution for Gujarati text, even though it contributes in developing NLP applications.So, the objective of this research work is to study the anaphora and find its suitable newlineantecedents automatically in Gujarati text discourse. It requires the pre-processing components such as Sentence tokenizer, POS Tagger, Chunker, and Morphological newlineAnalyzer. newlineA sentence tokenizer, which is not only useful in Anaphora Resolution but also valuable for Text summarization, POS tagger, and Chunker development too, is the newlineinitial step in text processing. Dot, exclamation marks, single quotes, double quotes,question marks, and consecutive multiple occurrences of sentence end markers are considered as sentence end markers. A statistical model, namely Punkt, has been developed using the Gujarati news article corpus. Linguistic rules are designed to newlinehandle the issues such as the abbreviation, the parenthetical expressions, the order list covering different patterns, and quotation marks ambiguity due to different types newlineof quotation marks as well as direct speech sentences. An average accuracy achieved is 99.34% using corpus consisting of the six different article categories, namely newlineBusiness, Crime, Politics, Sports, Technical, and Vaividhya including EMILLE corpus. | |
dc.format.extent | xxii,182p | |
dc.language | English | |
dc.relation | ||
dc.rights | university | |
dc.title | Hybrid Approach Based Lexical and Morphosyntactic Components Modelling for Resolving Pronominal Anaphora in Gujarati Text | |
dc.title.alternative | ||
dc.creator.researcher | Tailor Chetanaben Maheshbhai | |
dc.subject.keyword | anaphora resolution - Gujarati language | |
dc.subject.keyword | Computer Science | |
dc.subject.keyword | Natural Language Processing | |
dc.description.note | ||
dc.contributor.guide | Patel Bankim | |
dc.publisher.place | Barodli | |
dc.publisher.university | Uka Tarsadia University | |
dc.publisher.institution | Faculty of Computer Science | |
dc.date.registered | 2015 | |
dc.date.completed | 2022 | |
dc.date.awarded | 2022 | |
dc.format.dimensions | ||
dc.format.accompanyingmaterial | CD | |
dc.source.university | University | |
dc.type.degree | Ph.D. | |
Appears in Departments: | Faculty of Computer Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 497.01 kB | Adobe PDF | View/Open |
02_declaration.pdf | 383.25 kB | Adobe PDF | View/Open | |
03_certificates.pdf | 2.07 MB | Adobe PDF | View/Open | |
04_acknowledgement.pdf | 869.61 kB | Adobe PDF | View/Open | |
05_content.pdf | 1.7 MB | Adobe PDF | View/Open | |
06_preface.pdf | 604.25 kB | Adobe PDF | View/Open | |
07_chapter 1.pdf | 1.58 MB | Adobe PDF | View/Open | |
08_chapter_2.pdf | 1.42 MB | Adobe PDF | View/Open | |
09_chapter_3.pdf | 1.19 MB | Adobe PDF | View/Open | |
10_chapter_4.pdf | 1.52 MB | Adobe PDF | View/Open | |
11_chapter_5.pdf | 1.64 MB | Adobe PDF | View/Open | |
12_chapter_6.pdf | 1.45 MB | Adobe PDF | View/Open | |
13_chapter_7.pdf | 1.45 MB | Adobe PDF | View/Open | |
14_chapter_8.pdf | 1.12 MB | Adobe PDF | View/Open | |
15_chapter_9.pdf | 669.35 kB | Adobe PDF | View/Open | |
16_chapter_10.pdf | 855.22 kB | Adobe PDF | View/Open | |
17_references.pdf | 940.28 kB | Adobe PDF | View/Open | |
18_plagiarism_report.pdf | 521.24 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 1.75 MB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: