Novel query analysis and ontology based clustering for data management in hadoop

Pradeep, D

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/335495

Title:	Novel query analysis and ontology based clustering for data management in hadoop
Researcher:	Pradeep, D
Guide(s):	Sundar, C and Babu, P
Keywords:	Hadoop Ontology Data management
University:	Anna University
Completed Date:	2021
Abstract:	In the current information explosion era, the amount of data stored in the form of text, image, video and audio is enormous and is expected to grow in future. Moreover, the technological innovations in fields like e-libraries and epublishing is increasing the magnitude and usage of digital documents. Thus, the tools that can be used to analyze and discover useful knowledge are in great demand. In particular, the usage of data mining techniques on text documents, also known as intelligent text analysis, text data mining or Knowledge Discovery in Text (KDT) is becoming more popular. The reason behind this popularity is that most of the information (over 80%) is currently stored as text and therefore mining text information has a great commercial potential value. Document clustering helps users to effectively navigate, summarize and organize text documents into meaningful clusters, knowledge that helps to handle huge amount of text extraction. The continuous increase of computational power causes astonishing flow of data. Processing this voluminous data in a reliable and cost effective manner to draw valuable insights and business intelligence is of rapidly growing interest. Difficulties among the processes such storing, analyzing and visualizing the huge amount of datasets can be effectively handled with the big data concepts. Big data analytics can be extract hidden patterns and secret correlations, this kind of business assistances leads a greater successes to the organization. For this reason, big data implementations got to be analyzed and executed as accurately as possible. Bottleneck issues handled in the field of information retrieval are analysis of query and management of data storage. Hadoop is a large scale environment that is supported with larger storage and faster processing. Even though, it suffers from these challenging issues while the number of information requesters is higher. newline
Pagination:	p.101-111
URI:	http://hdl.handle.net/10603/335495
Appears in Departments:	Faculty of Information and Communication Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	116.93 kB	Adobe PDF	View/Open
02_certificates.pdf		185.96 kB	Adobe PDF	View/Open
03_vivaproceedings.pdf		368.8 kB	Adobe PDF	View/Open
04_bonafidecertificate.pdf		292.4 kB	Adobe PDF	View/Open
05_abstracts.pdf		92.3 kB	Adobe PDF	View/Open
06_acknowledgements.pdf		379.44 kB	Adobe PDF	View/Open
07_contents.pdf		10.18 kB	Adobe PDF	View/Open
08_listoftables.pdf		84.45 kB	Adobe PDF	View/Open
09_listoffigures.pdf		84.81 kB	Adobe PDF	View/Open
10_listofabbreviations.pdf		108.65 kB	Adobe PDF	View/Open
11_chapter1.pdf		294.11 kB	Adobe PDF	View/Open
12_chapter2.pdf		187.55 kB	Adobe PDF	View/Open
13_chapter3.pdf		206.48 kB	Adobe PDF	View/Open
14_chapter4.pdf		236.84 kB	Adobe PDF	View/Open
15_chapter5.pdf		129.23 kB	Adobe PDF	View/Open
16_conclusion.pdf		83.14 kB	Adobe PDF	View/Open
17_references.pdf		163.17 kB	Adobe PDF	View/Open
18_listofpublications.pdf		110.89 kB	Adobe PDF	View/Open
80_recommendation.pdf		102.31 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET