Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/264844
Title: | Efficient Computing Of Big Data Harmonization |
Researcher: | Jigna Ashish Patel |
Guide(s): | Praiyanka Sharma |
Keywords: | Big data,OLAP,Data harmonization,OOHI Engineering and Technology,Engineering,---Select--- |
University: | Gujarat Technological University |
Completed Date: | 2019 |
Abstract: | By the improvement and expansion of the internet, social media, internet of things and advanced technology in the fields of healthcare, infrastructure, Agriculture, Education, Scientific fields and in Data Analytics, data generation growth augmented exponentially. In the world of exploding data, storage and speed become the burning issues. Big Data was in existence since long back but due to hype of social media usage it is well-known now. Cost-effective and innovative methodology to process information which can be used for good decision making is in demand. To manage, process and to analyze Big Data both academia and industry work together for cost effective solutions. Big Data harmonization is the process of providing a single platform to all heterogeneous data and variety of data. Extraction, transformation and loading is the essential step in the process of data warehouse. Data harmonization is the alternate name for the data warehouse to provide the common level of granularity. It is the base platform to work upon for OLAP servers and data analytics. Computing of Big Data OLAP requires lot of challenges like scaling of data, speed of processing, storage of data, query performance and lot of others. Mainly Roll up, Drill down, Slice and Dice operations are performed on data. In Big Data Era ROLAP (Relational Online Analytical Processing) or HOLAP (Hybrid Online Analytical Processing) takes more space due to costly joint operation and takes more evolution time to process query. In this thesis MOLAP (Multidimensional OLAP) is adopted. Main focus is to work up on two challenges of Big Data as storage and velocity over OLAP. To work effectively and in parallel manner to deal with volume of Big Data we implemented distributed environment. Author has proposed and implemented a technique name as OOHI (OLAP on Hadoop by Indexing) that offer simplified and efficient multidimensional model. Overall work of OOHI is divided into Data Loading Module, Data Storage Module, Dimension Encoding Module, Dimension traversal M |
URI: | http://hdl.handle.net/10603/264844 |
Appears in Departments: | Computer/IT Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title page.pdf | Attached File | 100.25 kB | Adobe PDF | View/Open |
02_declaration.pdf | 123.92 kB | Adobe PDF | View/Open | |
03_certificate.pdf | 44.73 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 72.12 kB | Adobe PDF | View/Open | |
05_acknowledgement.pdf | 101.92 kB | Adobe PDF | View/Open | |
06_table of content.pdf | 121.38 kB | Adobe PDF | View/Open | |
07_list of abbreviations.pdf | 26.75 kB | Adobe PDF | View/Open | |
08_list of figures.pdf | 44.54 kB | Adobe PDF | View/Open | |
09_list of tables.pdf | 21.25 kB | Adobe PDF | View/Open | |
10_chapter 1.pdf | 165.88 kB | Adobe PDF | View/Open | |
11_chapter 2.pdf | 352.76 kB | Adobe PDF | View/Open | |
12_chapter 3.pdf | 400.79 kB | Adobe PDF | View/Open | |
13_chapter 4.pdf | 707.24 kB | Adobe PDF | View/Open | |
14_chapter 5.pdf | 1.17 MB | Adobe PDF | View/Open | |
15_chapter 6.pdf | 161.28 kB | Adobe PDF | View/Open | |
16_references.pdf | 253.14 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: