Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/30136
Title: | Enhanced graph based techniques for Single and multi document Summarization |
Researcher: | Hariharan S |
Guide(s): | Srinivasan R |
Keywords: | Inverse Document Frequency Term Frequency Term Occurrence World Wide Web |
Upload Date: | 8-Dec-2014 |
University: | Anna University |
Completed Date: | 01/05/2010 |
Abstract: | The World Wide Web has become one of the largest information newlineand knowledge repositories in the world Inspite of its easy access it is newlinevirtually impossible for any user to browse or read a large number of such newlineindividual documents available online Text summarization fulfils such newlineinformation seeking goals by providing a method for the user to quickly view newlinethe highlights or relevant portions of document collection With tons of newlineinformation uploaded on the web on a daily basis the task of summarizing newlinebecomes a necessity Also locating and browsing information quickly from a newlinecollection of documents within a short span of time becomes possible with the newlinehelp of summarization This has led to large scale research efforts in text newlinesummarization The issues discussed above necessitate the need for an newlineautomated summarization system The objective of this thesis is to find newlineenhancements to existing graph based methods for summarizing single newlinedocuments and multi document clusters newlineThe objective of automated text summarization is to condense the newlinegiven text to its essential contents based upon the user s choice of brevity newlineThe summarization techniques are broadly categorized into two schemes newlineextraction and abstraction Extraction involves picking up the most important newlinesentences from a document using statistical approaches To measure the newlinesimilarity among the documents, several choices are available like cosine newlinedice and jaccard Also several approaches like Term Frequency TF Term newlineOccurrence TO Inverse Document Frequency IDF and TF multiplied by newlineIDF TF IDF that would influence the content similarity are investigated in newlinethis report newline newline |
Pagination: | xx, 132p. |
URI: | http://hdl.handle.net/10603/30136 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 42.24 kB | Adobe PDF | View/Open |
02_certificate.pdf | 5.85 kB | Adobe PDF | View/Open | |
03_abstract.pdf | 14.19 kB | Adobe PDF | View/Open | |
04_acknowledgement.pdf | 7.22 kB | Adobe PDF | View/Open | |
05_content.pdf | 39.53 kB | Adobe PDF | View/Open | |
06_chapter1.pdf | 55.43 kB | Adobe PDF | View/Open | |
07_chapter2.pdf | 98.58 kB | Adobe PDF | View/Open | |
08_chapter3.pdf | 79.73 kB | Adobe PDF | View/Open | |
09_chapter4.pdf | 184.52 kB | Adobe PDF | View/Open | |
10_chapter5.pdf | 186.96 kB | Adobe PDF | View/Open | |
11_chapter6.pdf | 103.62 kB | Adobe PDF | View/Open | |
12_chapter7.pdf | 8.85 kB | Adobe PDF | View/Open | |
13_reference.pdf | 56.73 kB | Adobe PDF | View/Open | |
14_publication.pdf | 6.84 kB | Adobe PDF | View/Open | |
15_vitae.pdf | 5.99 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: