Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/522333
Title: | An enhanced big data processing model for drill down search engine with content mining crawler |
Researcher: | Ragavan, N |
Guide(s): | Yesubai Rubavathi, C |
Keywords: | Big data Computer Science Computer Science Software Engineering Content mining Down search engine Engineering and Technology |
University: | Anna University |
Completed Date: | 2023 |
Abstract: | Big Data is the current key topic in today s emerging technology. Right now, the top most popular search engines like Google, Yahoo are offering rank based search features in which the end users need to page navigate to find the exact search results. There is no Drill down (multi-level) search feature. Also the search results of existing search engines are not organized in a category based view. Thus conventional Search Engines lack Category feature as well as drill down search feature. To find the exact search results among the vast paginated search results, Drill down search would be beneficial for the end user. Category based search feature with higher level of drill down search leads to get accurate search results. newlineExisting Search Engine providers (Google, Yahoo) are implementing their own custom file system in order to store the crawler big data efficiently. Google developed its own file system called Google File System (GFS). These search engine providers store the big data in fixed-size chunks of 64 megabytes. These files are extremely rarely overwritten or shrunk and are usually appended to or read. When going to implement a drill down search engine with category based search results view, it is not possible to have a fixed chunk of files. Implementing drill down search with category based search view increases the number and size of the big data file system during the crawling process. This will lead to big problems for search engine organizations about how to store their huge amount of information and indexing such data. Traditional disk storage analytics may not be able to handle such large quantities of big data. New storage technologies are not in the near future. Next generation search engines would be category based and offer multi-level search capability. The purpose of this dissertation is to implement a big data storage reduction file system model that supports Drill down search feature with category based search results view using content mining crawler. Developing a good file system model fo |
Pagination: | xv,116p. |
URI: | http://hdl.handle.net/10603/522333 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 175.36 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 5.69 MB | Adobe PDF | View/Open | |
03_content.pdf | 148.62 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 131.25 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 439.18 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 309.36 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 553.34 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 784.95 kB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 957.98 kB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 1.19 MB | Adobe PDF | View/Open | |
11_chapter 7.pdf | 911.74 kB | Adobe PDF | View/Open | |
12_annexures.pdf | 174.37 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 177.56 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: