Hybrid approaches for the analysis of relevant high quality xml web data

Gopianand M

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/296940

Title:	Hybrid approaches for the analysis of relevant high quality xml web data
Researcher:	Gopianand M
Guide(s):	Jaganathan P
Keywords:	Engineering and Technology Computer Science Computer Science Information Systems XUL (Document markup language)
University:	Anna University
Completed Date:	2019
Abstract:	In recent days, the eXtensible Markup Language (XML) based web newlineapplications are widely used in data exchange and network services. In machine newlinelearning database, keyword search can be implemented and also it is possible on newlinegraph structure that combines the relational, html and XML data. The search newlinemechanism for XML files is a very important and essential technique to retrieve newlinethe text content from XML. But, it is difficult to identify user intentions through newlinethe keyword. In web search engine, keyword search is one of the most important newlinesearch representations for regular users. For this reason, XML language is newlinebecoming a standard in web data representation. XML supports keyword search newlineand it also allows users to create queries without the knowledge of query newlinelanguage and database schema, so that it is considered as a user-friendly method. newlineThe user can access the relevant web data by analyzing keyword in XML web. newlineIn web search engine, querying and extracting data from web has been an ongoing research issue since the birth of the web. As amount of data is increasing day by day, extracting data becomes a difficult task. In order to solve this issue we develop an efficient method for relevant XML web data quality newlineanalysis. In the initial research, the quality analysis of relevant XML web data is newlinedone using clustering and classification technique. Clustering is employed by newlineModified Fuzzy C Means (MFCM) clustering and classification by K- Nearest newlineNeighbor (KNN) algorithm. At first, a number of XML documents are collected newlineand clustered based on keyword depending on type of XML files by means of newlinemodified fuzzy c means algorithm. In order to find the relevant XML web data, newlinethe clustered features are then applied to the KNN classifier which results in newlinehigh accuracy. newline newline
Pagination:	xviii, 155p.
URI:	http://hdl.handle.net/10603/296940
Appears in Departments:	Faculty of Science and Humanities

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	66.84 kB	Adobe PDF	View/Open
02_certificates.pdf		200.4 kB	Adobe PDF	View/Open
03_abstracts.pdf		6.52 kB	Adobe PDF	View/Open
04_acknowledgements.pdf		42.13 kB	Adobe PDF	View/Open
05_contents.pdf		20.16 kB	Adobe PDF	View/Open
06_listofabbreviations.pdf		94.01 kB	Adobe PDF	View/Open
07_chapter1.pdf		311.46 kB	Adobe PDF	View/Open
08_chapter2.pdf		277.62 kB	Adobe PDF	View/Open
09_chapter3.pdf		424.46 kB	Adobe PDF	View/Open
10_chapter4.pdf		262.8 kB	Adobe PDF	View/Open
11_chapter5.pdf		307.15 kB	Adobe PDF	View/Open
12_chapter6.pdf		264.9 kB	Adobe PDF	View/Open
13_conclusion.pdf		127.38 kB	Adobe PDF	View/Open
14_references.pdf		138.82 kB	Adobe PDF	View/Open
15_listofpublications.pdf		54.29 kB	Adobe PDF	View/Open
80_recommendation.pdf		180.66 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET