Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/296813
Title: | Enhanced information extraction on multiple search engines using hybrid algorithm |
Researcher: | Gomathi A |
Guide(s): | Raja K |
Keywords: | Engineering and Technology Computer Science Computer Science Information Systems Hybrid bass |
University: | Anna University |
Completed Date: | 2019 |
Abstract: | Currently, more than seventy percent of the people are using the newlineInternet to find and search for their desired data such as cooking recipes, newlinebeauty tips, favorite movies, symptoms of diseases, job details, etc. through newlinethe Search Engines (SEs). All the SEs are retrieving their results as a set of newlinedocuments but the most relevant results are not retrieved by the SEs to satisfy newlinethe user requirement. It is still a challenging task for the researchers. In this newlinethesis, Multiple Search Engines (MSE) concept is applied with effective newlinealgorithms to retrieve most relevant results to the input query based on the newlinecombination of the K-Means* algorithm and Particle Swarm Optimization newline(PSO).In order to retrieve the relevancy of the resultant documents, KMeans* newlinealgorithm is applied, which enhances the concept of traditional newlineclustering technique named K-Means algorithm. Unlike the K-Means newlinealgorithm newly proposed K-Means* algorithm accepts the documents from newlinevarious resources in every cluster. The SEs such as Google, Yahoo and Bing newlineare selected as the resources for this work, as they are the most commonly newlineused SEs. The Google uses its own PageRank algorithm to provide the search newlineresults, the Bing uses the Index-serving technology named Sidebar to deliver newlinethe relevant search results and the Yahoo uses a Search Engine Provider to newlineshow the search results which are referred from some other companies. These newlinethree SEs results are fused by using Segmentation Fusion (SegFuse) method newlineand provide the single result list. The final single result set is fed as input to newlinethe K-Means* cluster algorithm, which classifies the same into different newlineclusters. newline newline |
Pagination: | xxi, 115p. |
URI: | http://hdl.handle.net/10603/296813 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 26.32 kB | Adobe PDF | View/Open |
02_certificates.pdf | 607.28 kB | Adobe PDF | View/Open | |
03_abstracts.pdf | 5.89 kB | Adobe PDF | View/Open | |
04_acknowledgements.pdf | 4.03 kB | Adobe PDF | View/Open | |
05_contents.pdf | 16.33 kB | Adobe PDF | View/Open | |
06_listofabbreviations.pdf | 58.75 kB | Adobe PDF | View/Open | |
07_chapter1.pdf | 123.95 kB | Adobe PDF | View/Open | |
08_chapter2.pdf | 124.99 kB | Adobe PDF | View/Open | |
09_chapter3.pdf | 323.73 kB | Adobe PDF | View/Open | |
10_chapter4.pdf | 201.96 kB | Adobe PDF | View/Open | |
11_chapter5.pdf | 128.44 kB | Adobe PDF | View/Open | |
12_conclusion.pdf | 24.58 kB | Adobe PDF | View/Open | |
13_references.pdf | 36.41 kB | Adobe PDF | View/Open | |
14_listofpublications.pdf | 18.67 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 125.81 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: