Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/335495
Title: Novel query analysis and ontology based clustering for data management in hadoop
Researcher: Pradeep, D
Guide(s): Sundar, C and Babu, P
Keywords: Hadoop
Ontology
Data management
University: Anna University
Completed Date: 2021
Abstract: In the current information explosion era, the amount of data stored in the form of text, image, video and audio is enormous and is expected to grow in future. Moreover, the technological innovations in fields like e-libraries and epublishing is increasing the magnitude and usage of digital documents. Thus, the tools that can be used to analyze and discover useful knowledge are in great demand. In particular, the usage of data mining techniques on text documents, also known as intelligent text analysis, text data mining or Knowledge Discovery in Text (KDT) is becoming more popular. The reason behind this popularity is that most of the information (over 80%) is currently stored as text and therefore mining text information has a great commercial potential value. Document clustering helps users to effectively navigate, summarize and organize text documents into meaningful clusters, knowledge that helps to handle huge amount of text extraction. The continuous increase of computational power causes astonishing flow of data. Processing this voluminous data in a reliable and cost effective manner to draw valuable insights and business intelligence is of rapidly growing interest. Difficulties among the processes such storing, analyzing and visualizing the huge amount of datasets can be effectively handled with the big data concepts. Big data analytics can be extract hidden patterns and secret correlations, this kind of business assistances leads a greater successes to the organization. For this reason, big data implementations got to be analyzed and executed as accurately as possible. Bottleneck issues handled in the field of information retrieval are analysis of query and management of data storage. Hadoop is a large scale environment that is supported with larger storage and faster processing. Even though, it suffers from these challenging issues while the number of information requesters is higher. newline
Pagination: p.101-111
URI: http://hdl.handle.net/10603/335495
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File116.93 kBAdobe PDFView/Open
02_certificates.pdf185.96 kBAdobe PDFView/Open
03_vivaproceedings.pdf368.8 kBAdobe PDFView/Open
04_bonafidecertificate.pdf292.4 kBAdobe PDFView/Open
05_abstracts.pdf92.3 kBAdobe PDFView/Open
06_acknowledgements.pdf379.44 kBAdobe PDFView/Open
07_contents.pdf10.18 kBAdobe PDFView/Open
08_listoftables.pdf84.45 kBAdobe PDFView/Open
09_listoffigures.pdf84.81 kBAdobe PDFView/Open
10_listofabbreviations.pdf108.65 kBAdobe PDFView/Open
11_chapter1.pdf294.11 kBAdobe PDFView/Open
12_chapter2.pdf187.55 kBAdobe PDFView/Open
13_chapter3.pdf206.48 kBAdobe PDFView/Open
14_chapter4.pdf236.84 kBAdobe PDFView/Open
15_chapter5.pdf129.23 kBAdobe PDFView/Open
16_conclusion.pdf83.14 kBAdobe PDFView/Open
17_references.pdf163.17 kBAdobe PDFView/Open
18_listofpublications.pdf110.89 kBAdobe PDFView/Open
80_recommendation.pdf102.31 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: