Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/296813
Title: Enhanced information extraction on multiple search engines using hybrid algorithm
Researcher: Gomathi A
Guide(s): Raja K
Keywords: Engineering and Technology
Computer Science
Computer Science Information Systems
Hybrid bass
University: Anna University
Completed Date: 2019
Abstract: Currently, more than seventy percent of the people are using the newlineInternet to find and search for their desired data such as cooking recipes, newlinebeauty tips, favorite movies, symptoms of diseases, job details, etc. through newlinethe Search Engines (SEs). All the SEs are retrieving their results as a set of newlinedocuments but the most relevant results are not retrieved by the SEs to satisfy newlinethe user requirement. It is still a challenging task for the researchers. In this newlinethesis, Multiple Search Engines (MSE) concept is applied with effective newlinealgorithms to retrieve most relevant results to the input query based on the newlinecombination of the K-Means* algorithm and Particle Swarm Optimization newline(PSO).In order to retrieve the relevancy of the resultant documents, KMeans* newlinealgorithm is applied, which enhances the concept of traditional newlineclustering technique named K-Means algorithm. Unlike the K-Means newlinealgorithm newly proposed K-Means* algorithm accepts the documents from newlinevarious resources in every cluster. The SEs such as Google, Yahoo and Bing newlineare selected as the resources for this work, as they are the most commonly newlineused SEs. The Google uses its own PageRank algorithm to provide the search newlineresults, the Bing uses the Index-serving technology named Sidebar to deliver newlinethe relevant search results and the Yahoo uses a Search Engine Provider to newlineshow the search results which are referred from some other companies. These newlinethree SEs results are fused by using Segmentation Fusion (SegFuse) method newlineand provide the single result list. The final single result set is fed as input to newlinethe K-Means* cluster algorithm, which classifies the same into different newlineclusters. newline newline
Pagination: xxi, 115p.
URI: http://hdl.handle.net/10603/296813
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File26.32 kBAdobe PDFView/Open
02_certificates.pdf607.28 kBAdobe PDFView/Open
03_abstracts.pdf5.89 kBAdobe PDFView/Open
04_acknowledgements.pdf4.03 kBAdobe PDFView/Open
05_contents.pdf16.33 kBAdobe PDFView/Open
06_listofabbreviations.pdf58.75 kBAdobe PDFView/Open
07_chapter1.pdf123.95 kBAdobe PDFView/Open
08_chapter2.pdf124.99 kBAdobe PDFView/Open
09_chapter3.pdf323.73 kBAdobe PDFView/Open
10_chapter4.pdf201.96 kBAdobe PDFView/Open
11_chapter5.pdf128.44 kBAdobe PDFView/Open
12_conclusion.pdf24.58 kBAdobe PDFView/Open
13_references.pdf36.41 kBAdobe PDFView/Open
14_listofpublications.pdf18.67 kBAdobe PDFView/Open
80_recommendation.pdf125.81 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: