Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/602728
Title: Semi Supervised Agriculture Information Extraction and Knowledge Graph Creation Model using Weighted Distributional Semantics Syntactic Dependencies
Researcher: Veena G
Guide(s): Deepa Gupta
Keywords: Compurint;Natural Language; Information Extraction; information extraction tools; natural language processing :NLP: text analytics; healthcare ; data integration and NLP; Information retrieval; data mining; knowledge management; Electronic Health Records; agricultural sector; soil; crop cultivation; animal husbandry; forestry; AGROVOC;Red Soil; Knowledge graph;
Engineering
Engineering and Technology
University: Amrita Vishwa Vidyapeetham University
Completed Date: 2023
Abstract: Information Extraction (IE) is a field of Natural Language Processing (NLP) that involves newlineautomatically extracting useful information from text data. Developing accurate information newlineextraction models requires overcoming several challenges, domain specific vocabulary, data newlineintegration challenges, dynamic data, and the need for domain expertise. It involves newlineidentifying and extracting specific pieces of information, such as entities, relationships, newlineevents, and concepts, from unstructured or semistructured data, such as text documents, newlineweb pages, or social media posts. It has applications in industries such as finance, healthcare, newlineagriculture, and social media analysis. IE systems and Knowledge Graphs (KGs) are newlineinterconnected because the former is used to extract information from unstructured data, newlinewhile the latter stores this information in an organized and easily accessible manner. newlineThe knowledge graph facilitates easy querying and analysis of information. The knowledge newlinegraph can be accessed by users in an organization to gain insights, make decisions, and newlineautomate processes. For the past few decades, there has been significant research activity in newlinethe field of automatic knowledge graph creation. Among the various tasks involved, triplet newlineextraction, which involves identifying entities and their relationships, has proven to be newlineparticularly challenging. Supervised approaches demand an extensive corpus of annotated newlinetraining data, comprising entities and relationships. This training data is employed to newlinetrain a classifier, which in turn, is utilized to extract relationships from the test data. newlineThough supervised models outperform unsupervised models, they are constrained by the newlineneed of labelled data for the triplet extraction task. In our study, the main focus is on newlineautomatically extracting triplets from agricultural text documents and constructing an newlineagricultural knowledge graph.A major contribution of this thesis is an unsupervised weighted distributional semantics approach for entity labeling in the agricultural...
Pagination: xv, 154
URI: http://hdl.handle.net/10603/602728
Appears in Departments:Amrita School of Computing

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File740.27 kBAdobe PDFView/Open
02_prelim pages.pdf802.47 kBAdobe PDFView/Open
03_certificate of plagiarism.pdf395.82 kBAdobe PDFView/Open
04_abstract.pdf57.03 kBAdobe PDFView/Open
05_contents.pdf70.48 kBAdobe PDFView/Open
06_chapter 1.pdf424.13 kBAdobe PDFView/Open
07_chapter 2.pdf147.82 kBAdobe PDFView/Open
08_chapter 3.pdf1.47 MBAdobe PDFView/Open
09_chapter 4.pdf827.59 kBAdobe PDFView/Open
10_chapter 5.pdf7.56 MBAdobe PDFView/Open
11_chapter 6.pdf34.83 kBAdobe PDFView/Open
12_annexure.pdf121.93 kBAdobe PDFView/Open
80_recommendation.pdf743.59 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: