Leveraging Syntactic Information for Coherent and Comprehensible Summarization

LITTON J KURISINKEL

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/386585

Title:	Leveraging Syntactic Information for Coherent and Comprehensible Summarization
Researcher:	LITTON J KURISINKEL
Guide(s):	Vasudeva Varma Kalidindi
Keywords:	Computer Science Computer Science Software Engineering Engineering and Technology
University:	International Institute of Information Technology, Hyderabad
Completed Date:	2021
Abstract:	Text summarization is a natural language processing problem which has been investigated by the NLP community for half a century. In the era of information explosion, the community has intensified research for more sophisticated methods for automated text summarization. Attempts were made in the past to frame extractive and abstractive techniques for multidocument summarization. Extractive techniques select a subset of sentences which can approximate the summary of the input corpus of documents, while abstractive summarization newlinetechniques construct a semantic representation and are expected to generate the summary in newlineits own learnt writing style. newlineExtractive techniques create an intermediate representation for the target text, capturing the newlinekey textual features. Possible approaches for intermediate representation are Topic Signatures, newlineWord frequency count, Latent Space Approaches using Matrix Factorizations, or Bayesian approaches. These intermediate representations are then used to assign scores for individual newlinelinguistic units within the text and select a subset of linguistic units which maximizes the total newlinescore as the summary of the target text. The mathematical scoring function for the summary newlineis generally composed of components to quantify topical coverage and topical diversity. They newlinereport the accuracy in terms of a measure called the ROUGE score. newlineRelatively less work is available on abstractive multi-document summarization in the past. newlineMost of them utilise sub- syntactical structures which are directly extracted from input documents to generate summary sentences. Sub syntactical structures such as phrases are reorganized to create summary sentences using a method which can ensure relevant topical newlinecoverage, topical diversity and gramaticality. They also incorporate means to ensure factual newlineaccuracy so that sentences generated by the abstract summarization system are factually correct with respect to original corpus. newlineDespite all the attempts to improve summarization in easily quantifiable dimensions
Pagination:
URI:	http://hdl.handle.net/10603/386585
Appears in Departments:	Computer Science and Engineering

Files in This Item:

File	Description	Size	Format
80_recommendation.pdf	Attached File	170.74 kB	Adobe PDF	View/Open
kjj_abstract.pdf		55.24 kB	Adobe PDF	View/Open
kjj_acknowledgemet.pdf		50.07 kB	Adobe PDF	View/Open
kjj_certificate.pdf		50.5 kB	Adobe PDF	View/Open
kjj_chapter1.pdf		107.92 kB	Adobe PDF	View/Open
kjj chapter 2.pdf		120.71 kB	Adobe PDF	View/Open
kjj_chapter 3-1.pdf		154.15 kB	Adobe PDF	View/Open
kjj_chapter 5.pdf		308.09 kB	Adobe PDF	View/Open
kjj_chapter 6.pdf		366.88 kB	Adobe PDF	View/Open
kjj_content.pdf		96.13 kB	Adobe PDF	View/Open
kjj_declaration.pdf		50.5 kB	Adobe PDF	View/Open
kjj_tables.pdf		89.19 kB	Adobe PDF	View/Open
kjj_thesis9 chapter4.pdf		448.84 kB	Adobe PDF	View/Open
kjj_title.pdf		84.91 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET