Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/303814
Title: Certain investigations on machine learning techniques for text summarization
Researcher: Priya V
Guide(s): Umamaheswari K
Keywords: Engineering and Technology
Computer Science
Computer Science Information Systems
Machine learning
Learning classifier systems
Explanation-based learning
University: Anna University
Completed Date: 2019
Abstract: Creating text summaries from large volume of unstructured text like customer reviews web log posts and social media posts is an important task in text mining applications These summaries reveal the useful information which portrays the entire document or reviews Summarization task could be performed using two major approaches Extractive and abstractive approach Extractive approach identifies the significant portion of the document that exposes the entire content and extracts it to form summary Abstractive approach creates summary based on keywords and semantics from the document which makes it difficult when compared to the other approach The extractive summarization task is complex due to redundancy large volume of text variability and semantics of natural language in the text The wide applicability as well as challenging nature of the task has inspired active research in the domain by both academic and industry experts This research focuses on the design of machine learning based systems for two core areas related to text mining namely Feature based text summarization and text similarity detection Existing feature ranking and summarization systems employ a variety of methods including latent semantic indexing Naïve Bayes and other semantics based approaches Due to the complexity of the task there is a need for developing efficient systems In this thesis a feature ranking system based on customer preferences have been developed Three different machine learning approaches have been adopted for feature based micro level extractive text summary formation In the feature ranking system features are extracted using standard dependency parsing algorithm newline
Pagination: xviii,153.
URI: http://hdl.handle.net/10603/303814
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File26.69 kBAdobe PDFView/Open
02_certificates.pdf286.5 kBAdobe PDFView/Open
03_abstracts.pdf83.52 kBAdobe PDFView/Open
04_acknowledgements.pdf44.69 kBAdobe PDFView/Open
05_contents.pdf83.08 kBAdobe PDFView/Open
06_list_of_tables.pdf82.26 kBAdobe PDFView/Open
07_list_of_figures.pdf54.16 kBAdobe PDFView/Open
08_list_of_abbreviations.pdf43.76 kBAdobe PDFView/Open
09_chapter1.pdf119.18 kBAdobe PDFView/Open
10_chapter2.pdf119.5 kBAdobe PDFView/Open
11_chapter3.pdf280.94 kBAdobe PDFView/Open
12_chapter4.pdf390.03 kBAdobe PDFView/Open
13_chapter5.pdf208.91 kBAdobe PDFView/Open
14_conclusion.pdf89.14 kBAdobe PDFView/Open
15_appendices.pdf57.08 kBAdobe PDFView/Open
16_references.pdf110.88 kBAdobe PDFView/Open
17_list_of_publications.pdf99.79 kBAdobe PDFView/Open
80_recommendation.pdf101.62 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: