Information Extraction and Named Entity Recognition for Surgical Data

RAVIKUMAR J

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/461914

Title:	Information Extraction and Named Entity Recognition for Surgical Data
Researcher:	RAVIKUMAR J
Guide(s):	RAMAKANTH KUMAR P
Keywords:	Computer Science Computer Science Interdisciplinary Applications Engineering and Technology
University:	Visvesvaraya Technological University, Belagavi
Completed Date:	2022
Abstract:	In the field of information extraction, based name entity recognition is one of the major challenges for the word processing operation. NER involves the handling of structured and unstructured data and processing them in terms of entities, people (patient s name), diseases, corresponding doctor and other previous history. NER is instinctively simple for human beings, but most of the name entities are the exact times and capitalized letters with respect to English language and which can be easily recognizable in usual language. There will be many problems which are tedious for the recognition process by human beings for handling these ambiguities with list of name which can be an added advantage in human based recognition process. newlineThe thesis is an effective NER system proposed and built with supervised approach in order to provide better result for NER approach as we trust the globally accepted English language as the input medium based on hospital and clinical database. This research study investigate the criteria for the enhanced performance of NER on clinical test case, for the best practices of NER we have selected two major approach such as LSTM(Long Short term Memory) based on Recurrent Neural Network approach. newlineThe major contributions are initially from the clinical data provided and identified with various entities which are required for processing input data into various predefined categories further followed by the division of data into training and testing such a way that 70% and 30% better accuracy is based on the neural network approach, the processing is made based on the calculation of the Term frequency (TF) and Inverse Document frequency (IDF) on a numerical statics on the corpus data. These calculations are done based on the identification of names from the input data, finally named entities are detected from the sentence using machine learning approach. The thesis also provides a hybrid approach which can build a novel NER system developed on python platform.
Pagination:
URI:	http://hdl.handle.net/10603/461914
Appears in Departments:	R V College of Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	117.98 kB	Adobe PDF	View/Open
03_content.pdf		226.48 kB	Adobe PDF	View/Open
04_abstract.pdf		354.43 kB	Adobe PDF	View/Open
05_chapter 1.pdf		603.43 kB	Adobe PDF	View/Open
06_chapter 2.pdf		472.29 kB	Adobe PDF	View/Open
07_chapter 3.pdf		904.3 kB	Adobe PDF	View/Open
08_chapter 4.pdf		732.14 kB	Adobe PDF	View/Open
09_chapter 5.pdf		911.64 kB	Adobe PDF	View/Open
10_chapter 6.pdf		1.46 MB	Adobe PDF	View/Open
12_annexures.pdf		465.29 kB	Adobe PDF	View/Open
80_recommendation.pdf		266.22 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET