Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/296940
Title: Hybrid approaches for the analysis of relevant high quality xml web data
Researcher: Gopianand M
Guide(s): Jaganathan P
Keywords: Engineering and Technology
Computer Science
Computer Science Information Systems
XUL (Document markup language)
University: Anna University
Completed Date: 2019
Abstract: In recent days, the eXtensible Markup Language (XML) based web newlineapplications are widely used in data exchange and network services. In machine newlinelearning database, keyword search can be implemented and also it is possible on newlinegraph structure that combines the relational, html and XML data. The search newlinemechanism for XML files is a very important and essential technique to retrieve newlinethe text content from XML. But, it is difficult to identify user intentions through newlinethe keyword. In web search engine, keyword search is one of the most important newlinesearch representations for regular users. For this reason, XML language is newlinebecoming a standard in web data representation. XML supports keyword search newlineand it also allows users to create queries without the knowledge of query newlinelanguage and database schema, so that it is considered as a user-friendly method. newlineThe user can access the relevant web data by analyzing keyword in XML web. newlineIn web search engine, querying and extracting data from web has been an ongoing research issue since the birth of the web. As amount of data is increasing day by day, extracting data becomes a difficult task. In order to solve this issue we develop an efficient method for relevant XML web data quality newlineanalysis. In the initial research, the quality analysis of relevant XML web data is newlinedone using clustering and classification technique. Clustering is employed by newlineModified Fuzzy C Means (MFCM) clustering and classification by K- Nearest newlineNeighbor (KNN) algorithm. At first, a number of XML documents are collected newlineand clustered based on keyword depending on type of XML files by means of newlinemodified fuzzy c means algorithm. In order to find the relevant XML web data, newlinethe clustered features are then applied to the KNN classifier which results in newlinehigh accuracy. newline newline
Pagination: xviii, 155p.
URI: http://hdl.handle.net/10603/296940
Appears in Departments:Faculty of Science and Humanities

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File66.84 kBAdobe PDFView/Open
02_certificates.pdf200.4 kBAdobe PDFView/Open
03_abstracts.pdf6.52 kBAdobe PDFView/Open
04_acknowledgements.pdf42.13 kBAdobe PDFView/Open
05_contents.pdf20.16 kBAdobe PDFView/Open
06_listofabbreviations.pdf94.01 kBAdobe PDFView/Open
07_chapter1.pdf311.46 kBAdobe PDFView/Open
08_chapter2.pdf277.62 kBAdobe PDFView/Open
09_chapter3.pdf424.46 kBAdobe PDFView/Open
10_chapter4.pdf262.8 kBAdobe PDFView/Open
11_chapter5.pdf307.15 kBAdobe PDFView/Open
12_chapter6.pdf264.9 kBAdobe PDFView/Open
13_conclusion.pdf127.38 kBAdobe PDFView/Open
14_references.pdf138.82 kBAdobe PDFView/Open
15_listofpublications.pdf54.29 kBAdobe PDFView/Open
80_recommendation.pdf180.66 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: