Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/452601
Title: | An Optimization Technique to Integrate Structured and Unstructured Data in the Big Data Analytics |
Researcher: | Rahul Kumar |
Guide(s): | Wasim,Javed and Subodh Kumar |
Keywords: | Computer Science Debezium Elastic search Kafka Zookeeper |
University: | Mangalayatan University |
Completed Date: | 2022 |
Abstract: | Big data explore new opportunities to modern era for discovering new information and knowledge for better understanding and rapid decision making. Big data refers to the bang of existing information. The big data movements are obsessed by the very large amounts of high-dimensional or unstructured data which is continuously generated and stored with a much cheaper cost for further use. There is massive increase in the data size in every sector. The term big data became very popular now days due to large amount of information s are added every second. With the increase of data, the efficient and critical analysis of data is required for its better understanding and decision making. In rapid growing industries lots of data is generating from different-different sources but to get proper meaningful information there is a demand of market to take all the generated data on one place. Researcher focuses on these types of issues and designs a solution architecture with the help of open-source architecture to integrate data from heterogeneous data sources on centralized repository. The researcher used data modelling before integrate to central repository, the schema registry use to maintain the data model of structured data. To model the unstructured data, the researcher developed a custom parser using ruby programming language. The focus on this study is sources of data may be any type any technology and target the researcher use elastic search as centralize repository. The architecture will work as plug in and play, means it will support any type of sources of input. As per background studies there is solution available in one architecture that integrate and analyze the structure and unstructured data with open-source tools.An architecture is designed by using open sources tools and techniques.Top of the integrated data, the researcher developed an analytics approach using the java programming language.Using proposed architecture data analytics has been significantly improved over the other contemporary approaches. |
Pagination: | |
URI: | http://hdl.handle.net/10603/452601 |
Appears in Departments: | Department Of Computer Science & Information Technology |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 250.2 kB | Adobe PDF | View/Open |
02_certificate.pdf | 598.5 kB | Adobe PDF | View/Open | |
03_abstract.pdf | 697.08 kB | Adobe PDF | View/Open | |
04_declaration.pdf | 233.54 kB | Adobe PDF | View/Open | |
05_acknowledgement.pdf | 173.55 kB | Adobe PDF | View/Open | |
06_content.pdf | 296.66 kB | Adobe PDF | View/Open | |
07_list_of_tables.pdf | 26.48 kB | Adobe PDF | View/Open | |
08_list_of_figures.pdf | 247.18 kB | Adobe PDF | View/Open | |
09_abbreviations.pdf | 72.09 kB | Adobe PDF | View/Open | |
10_chapter_1.pdf | 6.9 MB | Adobe PDF | View/Open | |
11_chapter_2.pdf | 2.21 MB | Adobe PDF | View/Open | |
12_chapter_3.pdf | 941.53 kB | Adobe PDF | View/Open | |
13_chapter_4.pdf | 12.21 MB | Adobe PDF | View/Open | |
14_chapter_5.pdf | 1.41 MB | Adobe PDF | View/Open | |
15_chapter_6.pdf | 993.05 kB | Adobe PDF | View/Open | |
16_list_of_publications.pdf | 257.33 kB | Adobe PDF | View/Open | |
17_reference.pdf | 1.91 MB | Adobe PDF | View/Open | |
80_recommendation.pdf | 283.43 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: