Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/16044
Title: | Meta data conceptual mining model using analysis of bilateral intelligence for effective text clustering |
Researcher: | Koteeswaran, S |
Guide(s): | Kannan, E |
Keywords: | Computer Science Data Mining Text Mining Naïve Levenberg Marquardt algorithm |
Upload Date: | 20-Feb-2014 |
University: | Vel Tech Dr. R R and Dr. S R Technical University |
Completed Date: | March 2013 |
Abstract: | Data mining from organizational data for extracting hidden knowledge is a growing field of study, which forms new study groups named knowledge discovery in database. Engineering applications of data mining which include, web mining, network mining, image mining and multimedia mining in general and text mining in particular. Predicting business flow, finding recent trends, sales and markets monitoring, forecasts, competition monitoring, expenditures control, revenues and administrative processes, finding volcanoes on Venus, Earth geophysics, earthquake photography from space, atmospheric science are some of the real world applications of data mining. Around 80% of data in the world are stored in unstructured textual format. Hence, the text mining and document clustering are major research areas in the past few years. Designing a effective and novel text mining requires high dimensionality, dynamic methodology, fast information access, specialized knowledge extraction from very huge data sets. Variety of approaches are proposed in the literature and these approaches vary with a range of tranditional k-means algorithm, term based methods, pattern taxonomy model and rank based methods. Existing text mining methods has many pitfalls like slow processing, lesser scalability, unable to solve conceptual problems like synonomy and polysemy. Therefore, this thesis focused text mining, proposed conceptual analysis and concentrated to solve conceptual problem like synonomy. Initially, this research work proposed metadata conceptual mining model for effective text mining. The proposed work executes in two phases of manipulation, which are training phase and testing phase. Initially in the pre-processing stage, the di-grams such as in, as, it; and tri-grams such as are, for, ing are removed from the documents. The proposed work created a data structure called, Significant Term List (STL) for each category of documents. A list of keywords on each domain of study and for each field of study is added in the concern STL. |
Pagination: | xvii, 147p. |
URI: | http://hdl.handle.net/10603/16044 |
Appears in Departments: | Department of Computer Science and Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 66.63 kB | Adobe PDF | View/Open |
02_certificate.pdf | 264.33 kB | Adobe PDF | View/Open | |
03_declaration.pdf | 268.8 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 26.59 kB | Adobe PDF | View/Open | |
05_acknowledgements.pdf | 35.49 kB | Adobe PDF | View/Open | |
06_contents.pdf | 26.08 kB | Adobe PDF | View/Open | |
07_list of tables.pdf | 18.25 kB | Adobe PDF | View/Open | |
08_list of figures.pdf | 22.83 kB | Adobe PDF | View/Open | |
09_list of abbreviations.pdf | 21.71 kB | Adobe PDF | View/Open | |
10_chapter 1.pdf | 143.1 kB | Adobe PDF | View/Open | |
11_chapter 2.pdf | 139.84 kB | Adobe PDF | View/Open | |
12_chapter 3.pdf | 194.79 kB | Adobe PDF | View/Open | |
13_chapter 4.pdf | 420.77 kB | Adobe PDF | View/Open | |
14_chapter 5.pdf | 133.4 kB | Adobe PDF | View/Open | |
15_chapter 6.pdf | 15.51 kB | Adobe PDF | View/Open | |
16_references.pdf | 67.55 kB | Adobe PDF | View/Open | |
17_publications.pdf | 14.41 kB | Adobe PDF | View/Open | |
18_biography.pdf | 11.68 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: