Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/227206
Title: | Semantic web mining of unstructured data |
Researcher: | Manuja, Manoj |
Guide(s): | Garg, Deepak |
Keywords: | Computer Science Semantic kernnel Semantic Web Mining SVM |
University: | Thapar Institute of Engineering and Technology |
Completed Date: | 2014 |
Abstract: | Over the last couple of decades, web classification has gradually transitioned from syntax to semantic centered approach that classifies the text based on domain ontologies. These ontologies are either built manually or populated automatically using machine learning techniques. Pre-requisite condition to build such system is the availability of ontology which may be either full-fledged domain ontology or a seed ontology that can be enriched automatically. This is a dependency condition for any given semantic based text classification system. We have designed, developed and implemented a web classification system that is self-governed in terms of ontology population and does not require any pre-built ontology either full-fledged or seed. It starts from user query, build a seed ontology from it and automatically enrich it by extracting concepts from the downloaded documents only. The evaluated parameters like precision (85%), accuracy (86%), AUC (Convex) and MCC (High + ive) provide a better worth of the proposed system when compared with similar automated text classification systems. We have used Support Vector Machines (SVMs) to find similarity / dissimilarity measures among concepts and features so that similar concepts are linked together for optimal knowledge discovery. The learning system we have developed above has two components kernel machine for encapsulating the learning task and kernel function for imbibing the learning hypothesis. Linear kernel function has been used which primary exploits syntactic structures of the text. To improve the scope of knowledge extraction, we have exploited semantic kernel functions which use a-priori semantic information for knowledge extraction. Therefore, building the classification system with semantic kernel functions instead of linear kernel functions forms the next step of our research. We have tried to validate vi the performance and accuracy parameters obtained above by way of using semantic kernel function in place of linear kernel function. |
Pagination: | xii, 155p. |
URI: | http://hdl.handle.net/10603/227206 |
Appears in Departments: | Department of Computer Science and Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
file10(publications).pdf | Attached File | 152.63 kB | Adobe PDF | View/Open |
file11(references).pdf | 341.67 kB | Adobe PDF | View/Open | |
file1(title).pdf | 25.7 kB | Adobe PDF | View/Open | |
file2(certificate).pdf | 234.02 kB | Adobe PDF | View/Open | |
file3(preliminary pages).pdf | 585.87 kB | Adobe PDF | View/Open | |
file4(chapter 1).pdf | 303.33 kB | Adobe PDF | View/Open | |
file5(chapter 2).pdf | 718.97 kB | Adobe PDF | View/Open | |
file6(chapter 3).pdf | 493.64 kB | Adobe PDF | View/Open | |
file7(chapter 4).pdf | 543.86 kB | Adobe PDF | View/Open | |
file8(chapter 5).pdf | 512.35 kB | Adobe PDF | View/Open | |
file9(chapter 6).pdf | 192.49 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: