Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/274146
Title: Feature based Extractive Text Summarization for News Documents
Researcher: Patel Darshna
Guide(s): Saurabh Shah
Keywords: Engineering and Technology,Computer Science,Computer Science Software Engineering
University: C.U. Shah University
Completed Date: 2019
Abstract: Abstract newlineNow days, the rate of growth of data is expanding exponentially and World- newlineWide Web has become one of the largest data and information repository newlinein the world. Even though the large amount of data is available, to access newlinethe required data in appropriate form at right time is a big challenge. Thus newlineResearch and Development in automatic text summarization is become more newlinesignificant. The aim of text summarization process is to take one or more newlinesource documents as input and present most salient information in condensed newlineform in a manner which can meet the user s interest and requirement. newlineThe objective of this thesis is to develop extractive text summarization newlinesystem for single and multi-document. The proposed approach uses shallow newlinesentence features and fuzzy logic system for the aforementioned work. In newlineliterature various methods have been tried to use different features to carry newlineout the summarization task. Based on the feature score of sentences, high newlinescoring sentences are included in summary. However, it is not possible to newlineclaim that any of features used in summarization yields the best coverage for newlineall genres. There are no strict rules, but there are clues that could be exploited newlineto identify important topics and ideas. Initially our research work investigates newlinedifferent clues by implementing and assessing various features and identifies newlinebest features which can yield the best result for news domain. To sum-up, the newlineobjective of this assessment is to identify the best feature set which can obtain newlinegood content coverage with minimum time. newlineNext two phases of research work, utilizes the fuzzy model to deal with newlinethe imprecise and uncertainty of feature weight for single and multi-document newlinesummarization. After pre-processing and feature extraction, feature vector newlinematrix is given as input to the fuzzy logic system. The rule based decision newlinemodule is used to provide the balance weight to the extracted features and newlineupdate the feature vector matrix. We used cosine similarity measure to calculate newlinethe similarity between senten
Pagination: 148p.
URI: http://hdl.handle.net/10603/274146
Appears in Departments:Department of Computer Engineering

Files in This Item:
File Description SizeFormat 
c e r t i f i c a t e.pdfAttached File295.34 kBAdobe PDFView/Open
chapter 1.pdf446.5 kBAdobe PDFView/Open
chapter 2.pdf190.8 kBAdobe PDFView/Open
chapter 3.pdf323.85 kBAdobe PDFView/Open
chapter 4.pdf313.9 kBAdobe PDFView/Open
chapter 5.pdf983.28 kBAdobe PDFView/Open
chapter 6.pdf654.86 kBAdobe PDFView/Open
chapter 7.pdf73.41 kBAdobe PDFView/Open
priliminary pages.pdf441.88 kBAdobe PDFView/Open
title pages.pdf92.84 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: