Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/393258
Title: Framework for Analysing Diverse Patterns on Web and Social Media Through Data Analytics
Researcher: Gnanasambandan, P
Guide(s): Kumaravel, A
Keywords: Computer Science
Computer Science Interdisciplinary Applications
Engineering and Technology
University: Bharath University
Completed Date: 2022
Abstract: The advancement of internet technology and the dynamic nature of World Wide Web attract large number of users for publishing and retrieving information. But due to the heterogeneous and huge quantity of data, most of the information on the web is uninteresting to the users. Thus developing effective algorithm for retrieving the relevant information without accessing the complete data; at the outset, it has become an important concern among the Web mining research communities. Though most researchers focused their research work in this area, still their focus is only on retrieving similar patterns by leaving dissimilar patterns which are likely to contain the outlying data. This work concentrates on mining web content outliers which extracts the dissimilar web document taken from a group of documents of the same domain. Moreover, mining web content outliers helps in promoting business activities and improving the quality of the search results. In this work, a novel mathematical approach based on proportionate method is developed for retrieving relevant web document through outlier detection technique. The removal of outlaid documents improves the quality of search results catering to the user needs. Experimental results proved that this method gives better results in terms of accuracy, recall and specificity than the existing approach. Web usage outlier mining is dedicated to find usage patterns which differ significantly from the rest of the web documents taken from the web server log files. Shifting through the unstructured and ever growing web data for outliers is more challenging than finding outliers in numeric datasets. The existing web mining algorithms concentrate on finding similar patterns leaving dissimilar patterns that are likely to contain outlying data such as exceptions, noise, irrelevant and rare patterns. Analysis on these diverse patterns on the web plays a crucial role in identifying competitors in business, detecting frauds in banking activities, detecting network intrusion, and spam filtering
Pagination: 
URI: http://hdl.handle.net/10603/393258
Appears in Departments:Department of Information Technology

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File201.83 kBAdobe PDFView/Open
02_declaration.pdf270.37 kBAdobe PDFView/Open
03_certificate.pdf270.05 kBAdobe PDFView/Open
04_acknowledgement.pdf272.61 kBAdobe PDFView/Open
05_content.pdf208.07 kBAdobe PDFView/Open
06_list of tables and figure.pdf203.98 kBAdobe PDFView/Open
07_abstract.pdf183.12 kBAdobe PDFView/Open
08_chapter 1.pdf535.55 kBAdobe PDFView/Open
09_chapter 2.pdf342.91 kBAdobe PDFView/Open
10_chapter 3.pdf367.75 kBAdobe PDFView/Open
11_chapter 4.pdf588.39 kBAdobe PDFView/Open
12_chapter 5.pdf819.11 kBAdobe PDFView/Open
13_chapter 6.pdf963.17 kBAdobe PDFView/Open
14_chapter 7.pdf562.21 kBAdobe PDFView/Open
15_chapter 8.pdf357.57 kBAdobe PDFView/Open
16_chapter 9.pdf221.94 kBAdobe PDFView/Open
80_recommendation.pdf423.35 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: