Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/252990
Title: Multi level and multi attribute content depth measure based web document clustering for efficient web search
Researcher: Sureshkumar T
Guide(s): Shanthi N
Keywords: Clustering
Depth measure
Engineering and Technology,Computer Science,Computer Science Information Systems
University: Anna University
Completed Date: 2018
Abstract: Users of web search engines are habitually forced to examine newlinethrough the long ordered list of documents reverted by the engines. The newlineInformation Retrieval (IR) community has surveyed document clustering as newlinean apparent method of organizing retrieval results. The huge increase in newlineamount of information present on web, positions new challenges in clustering newlineregarding the underlying data model and nature of clustering algorithm. Many newlineconventional document clustering techniques mostly rely on single term newlineanalysis of document data set. Clustering similar documents from a large collection is not a one step task. It involves multiple stages which generally comprise three main newlinephases: feature extraction and selection, document representation, and newlineclustering. To achieve more accurate document clustering, more informative newlinefeatures such as topic, structure, layout, visual, content, semantic are essential newlineto be extracted in order to achieve efficient clusters of Web documents. The newlinemajor key requirements for web document clustering methods are relevance, newlinebrowsable summaries, overlap, snippet tolerance, speed, scaling and so on. newlineThe aim of this thesis is to improve the efficiency and accuracy of web newlinedocument clustering by considering the features that are extracted along with newlinecontent depth of the document. newline newline
Pagination: xix, 123p.
URI: http://hdl.handle.net/10603/252990
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File49.12 kBAdobe PDFView/Open
02_certificates.pdf1.12 MBAdobe PDFView/Open
03_abstract.pdf44.7 kBAdobe PDFView/Open
04_acknowledgment.pdf44.9 kBAdobe PDFView/Open
05_contents.pdf62.2 kBAdobe PDFView/Open
06_chapter1.pdf196.37 kBAdobe PDFView/Open
07_chapter2.pdf115.36 kBAdobe PDFView/Open
08_chapter3.pdf175.31 kBAdobe PDFView/Open
09_chapter4.pdf138.95 kBAdobe PDFView/Open
10_chapter5.pdf149.82 kBAdobe PDFView/Open
11_chapter6.pdf57.03 kBAdobe PDFView/Open
12_conclusion.pdf49.65 kBAdobe PDFView/Open
13_references.pdf111.23 kBAdobe PDFView/Open
14_publications.pdf46.07 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: