Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/468676
Title: | Hybrid system for recognizing Acronym expansions using heuristics And machine learning technique |
Researcher: | Menaha, R |
Guide(s): | Jayanthi, VE |
Keywords: | Engineering and Technology Engineering Engineering Electrical and Electronic Data Mining Neural Network Biomedical Abstract |
University: | Anna University |
Completed Date: | 2021 |
Abstract: | An acronym is a type of abbreviation made up of initial letter or letters of other words. An abbreviation is a short form (SF) of a phrase. The long-form (LF) of an abbreviation is called either a definition or an expansion. Abbreviations and acronyms are commonly used in biomedical literature, scientific and technical articles, information retrieval and web search, etc. Recognizing full forms that are associated with the acronym is important for identifying the meaning of an acronym that facilitates natural language processing and information retrieval from the literature. newlineSeveral research works are under practice to automate the recognition of acronym expansion pairs from text and web documents. Heuristics or Machine Learning approaches are prevalently pursued extracting acronym-definition from text or web. Existing heuristics and machine learning approaches recall rate (i.e. Number of retrieved acronym expansion pairs from document rate) is low. Hence, a hybrid model combining heuristics and machine learning is proposed in this work to retrieve more number of acronym expansion pairs from documents. The main objective of the work is to extract abbreviation definition pairs from text documents and also find the list of definitions of the acronym from the web. newlineFirstly, seven space reduction heuristics are applied to recognize acronyms from the text. Then, three mapping strategies are proposed for doing a sequence labeling task to recognize the expansion of the acronym. Since the usage of acronyms is more in biomedical literature, a biomedical dataset is created from Thalia semantic search engine. Then, the dataset is utilized to identify the potential abbreviation definition pairs. newline |
Pagination: | xv,115p. |
URI: | http://hdl.handle.net/10603/468676 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 29.48 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 3.04 MB | Adobe PDF | View/Open | |
03_content.pdf | 18.56 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 13.06 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 198.81 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 81.35 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 202.83 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 100.07 kB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 334.06 kB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 366.74 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 79.76 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 82.14 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: