Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/120959
Title: | Towards Intelligent Text Mining Under Limited Linguistic Resources |
Researcher: | Niraj Kumar |
Guide(s): | Dr. Kannan Srinathan, Dr. Vasudeva Varma |
Keywords: | Automatic Question Answering Automatic Summarization Evaluation Document Clustering Document Summarization Information Retrieval Keyphrase Extraction Text Mining |
University: | International Institute of Information Technology, Hyderabad |
Completed Date: | 03/06/2015 |
Abstract: | The establishment of new techniques or improvements in established core techniques to extract knowledge from the text document(s) by using limited linguistic resources is a challenging task of significant interest. The demand of such techniques is due to (1) the heavy increase in the size and variety of text resources, (2) the continuous arrival of text resources having different languages and different levels of computational capabilities and (3) the increase in the demand of variety of information needs. newline newline The graph based automated text analysis and text mining methods have received a great deal of attention in solving these issues. Actually, an important aspect of graph-based method is that it does not require deep linguistic knowledge, nor domain or language specific annotated corpora, which makes it highly portable to other domains, genres, or languages. The development of advanced graph theoretical techniques for social media mining has also enriched this area. newline newlineBased on the above discussed facts, we have identified some core issues (and techniques for them) like: (i) meaningful phrase identification (ii) differentiating role and sense of words, preferably via a single measure, (iii) handling information gap at the phrase level by using unsupervised scheme, (iv) integrating the importance of words as a core feature and (v) identifying group semantics and/or logically related features, (vi) sentence abstraction and so on. newline newlineThese techniques are very useful for multiple text mining applications like: (a) Document summarization, (b) Summarization Evaluation, (c) Document Clustering, (d) Key phrase Extraction and (e) Automatic Question Answering. The effective improvement in the results of our devised applications, over state-of-the-arts supervised, unsupervised applications, which use linguistic support and domain knowledge etc., prove the effectiveness of the proposed techniques. |
Pagination: | xvi, 160 |
URI: | http://hdl.handle.net/10603/120959 |
Appears in Departments: | Computer Science and Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 87.9 kB | Adobe PDF | View/Open |
02_certificate.pdf | 145.5 kB | Adobe PDF | View/Open | |
03_acknowledgements.pdf | 67.74 kB | Adobe PDF | View/Open | |
04_contents.pdf | 111.42 kB | Adobe PDF | View/Open | |
05_preface.pdf | 203.78 kB | Adobe PDF | View/Open | |
06_list of tables figures.pdf | 127.8 kB | Adobe PDF | View/Open | |
07_chapter 1.pdf | 528.35 kB | Adobe PDF | View/Open | |
08_chapter 2.pdf | 402.09 kB | Adobe PDF | View/Open | |
09_chapter 3.pdf | 749.27 kB | Adobe PDF | View/Open | |
10_chapter 4.pdf | 482.23 kB | Adobe PDF | View/Open | |
11_chapter 5.pdf | 496 kB | Adobe PDF | View/Open | |
12_chapter 6.pdf | 766.41 kB | Adobe PDF | View/Open | |
13_chapter 7.pdf | 540.51 kB | Adobe PDF | View/Open | |
14_chapter 8.pdf | 616.25 kB | Adobe PDF | View/Open | |
15_chapter 9.pdf | 230.37 kB | Adobe PDF | View/Open | |
16_references.pdf | 166.5 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: