Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/602728
Title: | Semi Supervised Agriculture Information Extraction and Knowledge Graph Creation Model using Weighted Distributional Semantics Syntactic Dependencies |
Researcher: | Veena G |
Guide(s): | Deepa Gupta |
Keywords: | Compurint;Natural Language; Information Extraction; information extraction tools; natural language processing :NLP: text analytics; healthcare ; data integration and NLP; Information retrieval; data mining; knowledge management; Electronic Health Records; agricultural sector; soil; crop cultivation; animal husbandry; forestry; AGROVOC;Red Soil; Knowledge graph; Engineering Engineering and Technology |
University: | Amrita Vishwa Vidyapeetham University |
Completed Date: | 2023 |
Abstract: | Information Extraction (IE) is a field of Natural Language Processing (NLP) that involves newlineautomatically extracting useful information from text data. Developing accurate information newlineextraction models requires overcoming several challenges, domain specific vocabulary, data newlineintegration challenges, dynamic data, and the need for domain expertise. It involves newlineidentifying and extracting specific pieces of information, such as entities, relationships, newlineevents, and concepts, from unstructured or semistructured data, such as text documents, newlineweb pages, or social media posts. It has applications in industries such as finance, healthcare, newlineagriculture, and social media analysis. IE systems and Knowledge Graphs (KGs) are newlineinterconnected because the former is used to extract information from unstructured data, newlinewhile the latter stores this information in an organized and easily accessible manner. newlineThe knowledge graph facilitates easy querying and analysis of information. The knowledge newlinegraph can be accessed by users in an organization to gain insights, make decisions, and newlineautomate processes. For the past few decades, there has been significant research activity in newlinethe field of automatic knowledge graph creation. Among the various tasks involved, triplet newlineextraction, which involves identifying entities and their relationships, has proven to be newlineparticularly challenging. Supervised approaches demand an extensive corpus of annotated newlinetraining data, comprising entities and relationships. This training data is employed to newlinetrain a classifier, which in turn, is utilized to extract relationships from the test data. newlineThough supervised models outperform unsupervised models, they are constrained by the newlineneed of labelled data for the triplet extraction task. In our study, the main focus is on newlineautomatically extracting triplets from agricultural text documents and constructing an newlineagricultural knowledge graph.A major contribution of this thesis is an unsupervised weighted distributional semantics approach for entity labeling in the agricultural... |
Pagination: | xv, 154 |
URI: | http://hdl.handle.net/10603/602728 |
Appears in Departments: | Amrita School of Computing |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 740.27 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 802.47 kB | Adobe PDF | View/Open | |
03_certificate of plagiarism.pdf | 395.82 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 57.03 kB | Adobe PDF | View/Open | |
05_contents.pdf | 70.48 kB | Adobe PDF | View/Open | |
06_chapter 1.pdf | 424.13 kB | Adobe PDF | View/Open | |
07_chapter 2.pdf | 147.82 kB | Adobe PDF | View/Open | |
08_chapter 3.pdf | 1.47 MB | Adobe PDF | View/Open | |
09_chapter 4.pdf | 827.59 kB | Adobe PDF | View/Open | |
10_chapter 5.pdf | 7.56 MB | Adobe PDF | View/Open | |
11_chapter 6.pdf | 34.83 kB | Adobe PDF | View/Open | |
12_annexure.pdf | 121.93 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 743.59 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: