Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/461914
Title: | Information Extraction and Named Entity Recognition for Surgical Data |
Researcher: | RAVIKUMAR J |
Guide(s): | RAMAKANTH KUMAR P |
Keywords: | Computer Science Computer Science Interdisciplinary Applications Engineering and Technology |
University: | Visvesvaraya Technological University, Belagavi |
Completed Date: | 2022 |
Abstract: | In the field of information extraction, based name entity recognition is one of the major challenges for the word processing operation. NER involves the handling of structured and unstructured data and processing them in terms of entities, people (patient s name), diseases, corresponding doctor and other previous history. NER is instinctively simple for human beings, but most of the name entities are the exact times and capitalized letters with respect to English language and which can be easily recognizable in usual language. There will be many problems which are tedious for the recognition process by human beings for handling these ambiguities with list of name which can be an added advantage in human based recognition process. newlineThe thesis is an effective NER system proposed and built with supervised approach in order to provide better result for NER approach as we trust the globally accepted English language as the input medium based on hospital and clinical database. This research study investigate the criteria for the enhanced performance of NER on clinical test case, for the best practices of NER we have selected two major approach such as LSTM(Long Short term Memory) based on Recurrent Neural Network approach. newlineThe major contributions are initially from the clinical data provided and identified with various entities which are required for processing input data into various predefined categories further followed by the division of data into training and testing such a way that 70% and 30% better accuracy is based on the neural network approach, the processing is made based on the calculation of the Term frequency (TF) and Inverse Document frequency (IDF) on a numerical statics on the corpus data. These calculations are done based on the identification of names from the input data, finally named entities are detected from the sentence using machine learning approach. The thesis also provides a hybrid approach which can build a novel NER system developed on python platform. |
Pagination: | |
URI: | http://hdl.handle.net/10603/461914 |
Appears in Departments: | R V College of Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 117.98 kB | Adobe PDF | View/Open |
03_content.pdf | 226.48 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 354.43 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 603.43 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 472.29 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 904.3 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 732.14 kB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 911.64 kB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 1.46 MB | Adobe PDF | View/Open | |
12_annexures.pdf | 465.29 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 266.22 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: