Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/261946
Title: Efficient text pattern mining and clustering approach for record retrieval using gso based prefix span and improved k m eans algorithm
Researcher: Rajesh Kumar A
Guide(s): Sasikala R
Keywords: Clustering
Data mining
Engineering and Technology,Computer Science,Computer Science Information Systems
University: Anna University
Completed Date: 2018
Abstract: Data mining analyses a large number of observational data sets,finds unsuspected relationships and summarizes the data in novel ways thatare both understandable and useful for the user. The wide-spread use ofdistributed information systems leads to the construction of large datacollections in various fields. Many data mining techniques have beenproposed for mining useful patterns in text documents. However, how toeffectively use and update discovered patterns is still an open research issue,especially in the domain of text mining. Since most existing text miningmethods adopted term-based approaches, they all suffer from the problems ofpolysemy and synonymy. Over the years, people have often held thehypothesis that pattern (or phrase)-based approaches should perform betterthan the term-based ones, but many experiments do not support this newlinehypothesis.The proposed method is performed on the records, based on the twomain phases, which are training and testing phases. In the training phase: 1)applying prefix span algorithm, 2) length and width constraints, 3) Optimalmining via Group Search Optimization (GSO). We first present the concept ofprefix span, which detects the frequent pattern using prefix tree. Based on thisprefix tree, length and width constraints are applied to handle restrictions. newline newline
Pagination: xx,157p.
URI: http://hdl.handle.net/10603/261946
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File17.37 kBAdobe PDFView/Open
02_certificates.pdf584.4 kBAdobe PDFView/Open
03_abstract.pdf66.35 kBAdobe PDFView/Open
04_acknowledgement.pdf72.09 kBAdobe PDFView/Open
05_contents.pdf108.64 kBAdobe PDFView/Open
06_list_of_symbols_and_abbreviations.pdf66.34 kBAdobe PDFView/Open
07_chapter1.pdf154.31 kBAdobe PDFView/Open
08_chapter2.pdf217.08 kBAdobe PDFView/Open
09_chapter3.pdf146.02 kBAdobe PDFView/Open
10_chapter4.pdf296.27 kBAdobe PDFView/Open
11_chapter5.pdf246.82 kBAdobe PDFView/Open
12_chapter6.pdf248.58 kBAdobe PDFView/Open
13_chapter7.pdf79.55 kBAdobe PDFView/Open
14_references.pdf127.56 kBAdobe PDFView/Open
15_publications.pdf72.72 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: