Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/452601
Title: An Optimization Technique to Integrate Structured and Unstructured Data in the Big Data Analytics
Researcher: Rahul Kumar
Guide(s): Wasim,Javed and Subodh Kumar
Keywords: Computer Science
Debezium
Elastic search
Kafka
Zookeeper
University: Mangalayatan University
Completed Date: 2022
Abstract: Big data explore new opportunities to modern era for discovering new information and knowledge for better understanding and rapid decision making. Big data refers to the bang of existing information. The big data movements are obsessed by the very large amounts of high-dimensional or unstructured data which is continuously generated and stored with a much cheaper cost for further use. There is massive increase in the data size in every sector. The term big data became very popular now days due to large amount of information s are added every second. With the increase of data, the efficient and critical analysis of data is required for its better understanding and decision making. In rapid growing industries lots of data is generating from different-different sources but to get proper meaningful information there is a demand of market to take all the generated data on one place. Researcher focuses on these types of issues and designs a solution architecture with the help of open-source architecture to integrate data from heterogeneous data sources on centralized repository. The researcher used data modelling before integrate to central repository, the schema registry use to maintain the data model of structured data. To model the unstructured data, the researcher developed a custom parser using ruby programming language. The focus on this study is sources of data may be any type any technology and target the researcher use elastic search as centralize repository. The architecture will work as plug in and play, means it will support any type of sources of input. As per background studies there is solution available in one architecture that integrate and analyze the structure and unstructured data with open-source tools.An architecture is designed by using open sources tools and techniques.Top of the integrated data, the researcher developed an analytics approach using the java programming language.Using proposed architecture data analytics has been significantly improved over the other contemporary approaches.
Pagination: 
URI: http://hdl.handle.net/10603/452601
Appears in Departments:Department Of Computer Science & Information Technology

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File250.2 kBAdobe PDFView/Open
02_certificate.pdf598.5 kBAdobe PDFView/Open
03_abstract.pdf697.08 kBAdobe PDFView/Open
04_declaration.pdf233.54 kBAdobe PDFView/Open
05_acknowledgement.pdf173.55 kBAdobe PDFView/Open
06_content.pdf296.66 kBAdobe PDFView/Open
07_list_of_tables.pdf26.48 kBAdobe PDFView/Open
08_list_of_figures.pdf247.18 kBAdobe PDFView/Open
09_abbreviations.pdf72.09 kBAdobe PDFView/Open
10_chapter_1.pdf6.9 MBAdobe PDFView/Open
11_chapter_2.pdf2.21 MBAdobe PDFView/Open
12_chapter_3.pdf941.53 kBAdobe PDFView/Open
13_chapter_4.pdf12.21 MBAdobe PDFView/Open
14_chapter_5.pdf1.41 MBAdobe PDFView/Open
15_chapter_6.pdf993.05 kBAdobe PDFView/Open
16_list_of_publications.pdf257.33 kBAdobe PDFView/Open
17_reference.pdf1.91 MBAdobe PDFView/Open
80_recommendation.pdf283.43 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: