Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/261946
Title: | Efficient text pattern mining and clustering approach for record retrieval using gso based prefix span and improved k m eans algorithm |
Researcher: | Rajesh Kumar A |
Guide(s): | Sasikala R |
Keywords: | Clustering Data mining Engineering and Technology,Computer Science,Computer Science Information Systems |
University: | Anna University |
Completed Date: | 2018 |
Abstract: | Data mining analyses a large number of observational data sets,finds unsuspected relationships and summarizes the data in novel ways thatare both understandable and useful for the user. The wide-spread use ofdistributed information systems leads to the construction of large datacollections in various fields. Many data mining techniques have beenproposed for mining useful patterns in text documents. However, how toeffectively use and update discovered patterns is still an open research issue,especially in the domain of text mining. Since most existing text miningmethods adopted term-based approaches, they all suffer from the problems ofpolysemy and synonymy. Over the years, people have often held thehypothesis that pattern (or phrase)-based approaches should perform betterthan the term-based ones, but many experiments do not support this newlinehypothesis.The proposed method is performed on the records, based on the twomain phases, which are training and testing phases. In the training phase: 1)applying prefix span algorithm, 2) length and width constraints, 3) Optimalmining via Group Search Optimization (GSO). We first present the concept ofprefix span, which detects the frequent pattern using prefix tree. Based on thisprefix tree, length and width constraints are applied to handle restrictions. newline newline |
Pagination: | xx,157p. |
URI: | http://hdl.handle.net/10603/261946 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 17.37 kB | Adobe PDF | View/Open |
02_certificates.pdf | 584.4 kB | Adobe PDF | View/Open | |
03_abstract.pdf | 66.35 kB | Adobe PDF | View/Open | |
04_acknowledgement.pdf | 72.09 kB | Adobe PDF | View/Open | |
05_contents.pdf | 108.64 kB | Adobe PDF | View/Open | |
06_list_of_symbols_and_abbreviations.pdf | 66.34 kB | Adobe PDF | View/Open | |
07_chapter1.pdf | 154.31 kB | Adobe PDF | View/Open | |
08_chapter2.pdf | 217.08 kB | Adobe PDF | View/Open | |
09_chapter3.pdf | 146.02 kB | Adobe PDF | View/Open | |
10_chapter4.pdf | 296.27 kB | Adobe PDF | View/Open | |
11_chapter5.pdf | 246.82 kB | Adobe PDF | View/Open | |
12_chapter6.pdf | 248.58 kB | Adobe PDF | View/Open | |
13_chapter7.pdf | 79.55 kB | Adobe PDF | View/Open | |
14_references.pdf | 127.56 kB | Adobe PDF | View/Open | |
15_publications.pdf | 72.72 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: