Optimized algorithms for classification of text documents

Maruthupandi,  J

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/342607

Title:	Optimized algorithms for classification of text documents
Researcher:	Maruthupandi, J
Guide(s):	Vimala Devi, K
Keywords:	Engineering and Technology Computer Science Computer Science Software Engineering Fuzzy text Text mining HABBFO
University:	Anna University
Completed Date:	2019
Abstract:	The usage and management of a massive volume of data has emerged as an interesting area of research as it poses innumerable challenges. Effectual strategies are requisite to make text mining more perspective. In the first approach, a fuzzy text classification algorithm is proposed for categorizing the provided set of documents. Fuzzy methodologies are adopted to diminish the dimensionality problem. The high dimensional documents are transmuted to low-dimensional fuzzy relevance vectors. The entire space is cleaved into sub-regions which are then integrated to form disparate categories. The experiential outcomes confirmed that this system has better speed and efficacy. In the second work, a wrapper centered Hybrid Artificial Bee Colony and Bacterial Foraging Optimization (HABBFO) structure has been proposed to choose the utmost pertinent feature subset for prediction. Pre-processing steps namely a) tokenization, b) stop-word removal along with c) stemming are done to extort features. Several experiments are done and it is perceived that the proposed system outperformed the other prevailing works in the domain of Feature Selection (FS). In the third approach, a novel multi-perspective centered similarity structure is developed to maximize the accuracy and performance of the similarity measures between 2 documents and document sets. The proposed measures consider 3 cases. The 1st case is when the feature is existent in both the documents. The 2nd case is when the feature is existent in only one document and the 3rd case is when the feature is existent is none of the documents. The similarity betwixt the 2 document sets is also evaluated. The efficacy of such proposed similarity measure was ascertained on disparate world-datasets for text applications encompassing single level classification and k-means clustering newline
Pagination:	xix,173 p.
URI:	http://hdl.handle.net/10603/342607
Appears in Departments:	Faculty of Information and Communication Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	96.19 kB	Adobe PDF	View/Open
02_certificates.pdf		42.35 kB	Adobe PDF	View/Open
03_vivaproceedings.pdf		92.29 kB	Adobe PDF	View/Open
04_bonafidecertificate.pdf		50.7 kB	Adobe PDF	View/Open
05_abstracts.pdf		65.03 kB	Adobe PDF	View/Open
06_acknowledgements.pdf		59.55 kB	Adobe PDF	View/Open
07_contents.pdf		75.62 kB	Adobe PDF	View/Open
08_listoftables.pdf		94.33 kB	Adobe PDF	View/Open
09_listoffigures.pdf		92.86 kB	Adobe PDF	View/Open
10_listofabbreviations.pdf		56.29 kB	Adobe PDF	View/Open
11_chapter1.pdf		365.59 kB	Adobe PDF	View/Open
12_chapter2.pdf		364.01 kB	Adobe PDF	View/Open
13_chapter3.pdf		422.89 kB	Adobe PDF	View/Open
14_chapter4.pdf		400.89 kB	Adobe PDF	View/Open
15_chapter5.pdf		531.33 kB	Adobe PDF	View/Open
16_chapter6.pdf		575.2 kB	Adobe PDF	View/Open
17_chapter7.pdf		84.18 kB	Adobe PDF	View/Open
18_conclusion.pdf		84.18 kB	Adobe PDF	View/Open
19_references.pdf		193.03 kB	Adobe PDF	View/Open
20_listofpublications.pdf		67.26 kB	Adobe PDF	View/Open
80_recommendation.pdf		101.8 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET