Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/227206
Title: Semantic web mining of unstructured data
Researcher: Manuja, Manoj
Guide(s): Garg, Deepak
Keywords: Computer Science
Semantic kernnel
Semantic Web Mining
SVM
University: Thapar Institute of Engineering and Technology
Completed Date: 2014
Abstract: Over the last couple of decades, web classification has gradually transitioned from syntax to semantic centered approach that classifies the text based on domain ontologies. These ontologies are either built manually or populated automatically using machine learning techniques. Pre-requisite condition to build such system is the availability of ontology which may be either full-fledged domain ontology or a seed ontology that can be enriched automatically. This is a dependency condition for any given semantic based text classification system. We have designed, developed and implemented a web classification system that is self-governed in terms of ontology population and does not require any pre-built ontology either full-fledged or seed. It starts from user query, build a seed ontology from it and automatically enrich it by extracting concepts from the downloaded documents only. The evaluated parameters like precision (85%), accuracy (86%), AUC (Convex) and MCC (High + ive) provide a better worth of the proposed system when compared with similar automated text classification systems. We have used Support Vector Machines (SVMs) to find similarity / dissimilarity measures among concepts and features so that similar concepts are linked together for optimal knowledge discovery. The learning system we have developed above has two components kernel machine for encapsulating the learning task and kernel function for imbibing the learning hypothesis. Linear kernel function has been used which primary exploits syntactic structures of the text. To improve the scope of knowledge extraction, we have exploited semantic kernel functions which use a-priori semantic information for knowledge extraction. Therefore, building the classification system with semantic kernel functions instead of linear kernel functions forms the next step of our research. We have tried to validate vi the performance and accuracy parameters obtained above by way of using semantic kernel function in place of linear kernel function.
Pagination: xii, 155p.
URI: http://hdl.handle.net/10603/227206
Appears in Departments:Department of Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
file10(publications).pdfAttached File152.63 kBAdobe PDFView/Open
file11(references).pdf341.67 kBAdobe PDFView/Open
file1(title).pdf25.7 kBAdobe PDFView/Open
file2(certificate).pdf234.02 kBAdobe PDFView/Open
file3(preliminary pages).pdf585.87 kBAdobe PDFView/Open
file4(chapter 1).pdf303.33 kBAdobe PDFView/Open
file5(chapter 2).pdf718.97 kBAdobe PDFView/Open
file6(chapter 3).pdf493.64 kBAdobe PDFView/Open
file7(chapter 4).pdf543.86 kBAdobe PDFView/Open
file8(chapter 5).pdf512.35 kBAdobe PDFView/Open
file9(chapter 6).pdf192.49 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: