Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/521720
Title: | Scaling real time processing using in memory computing for big data |
Researcher: | Kumar, Vivek |
Guide(s): | Mishra, Vinay Kumar |
Keywords: | Computer Science Computer Science Hardware and Architecture Engineering and Technology High performance computing |
University: | Dr. A.P.J. Abdul Kalam Technical University |
Completed Date: | 2023 |
Abstract: | Big Data emerged as a field of research after data mining. Big Data has three basic properties: volume, velocity, and variety. Data Stream is a stream of data generated or passed through various sources. Data Stream when combined with velocity of big data becomes Elephant flows. The streaming data can be stored in data lakes for a limited time, after which data overflows. The mentioned limitation motivated us to use in-memory computing technique. The research deals with: (i) the optimization techniques of big data volume and velocity, (ii) the visualization techniques of big data volume and velocity, (iii) the resource aware parallel computing techniques of big data volume and velocity, and (iv) the framework for high performance computing of big data volume and velocity. newlineThe principle of parallelism is employed to accelerate stream data computing. A framework, Mille Cheval Framework, is proposed and tested. Mille Cheval Framework is a GPU based in-memory High-Performance-Computing framework for accelerated processing of big-data streams. French words Mille and Cheval translates to Thousand Horses in English language and Sahastra Ashwa in Hindi language, respectively. Streams are temporally ordered, rapidly changing, ample in volume, and infinite in nature. It is nearly impossible to store the entire data stream due to its large volume and high velocity. GPU based High-Performance Computing (HPC) framework is proposed for accelerated processing of big-data streams using the in-memory data structure. We have implemented three parallel algorithms to prove the viability of the framework. The contributions of Mille Cheval are: (i) the viability of streaming on accelerators to increase throughput, (ii) carefully chosen hash algorithms to achieve low collision rate and high randomness, and (iii) memory sketches for approximation. The objective is to leverage the power of a single node using in-memory computing and hybrid computing. |
Pagination: | |
URI: | http://hdl.handle.net/10603/521720 |
Appears in Departments: | Dean P.G.S.R |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
80_recommendation.pdf | Attached File | 246.34 kB | Adobe PDF | View/Open |
abstract.pdf | 275.9 kB | Adobe PDF | View/Open | |
annexures.pdf | 786.9 kB | Adobe PDF | View/Open | |
chapter 1.pdf | 579.14 kB | Adobe PDF | View/Open | |
chapter 2.pdf | 412.58 kB | Adobe PDF | View/Open | |
chapter 3.pdf | 1.7 MB | Adobe PDF | View/Open | |
chapter 4.pdf | 626.09 kB | Adobe PDF | View/Open | |
chapter 5.pdf | 1.56 MB | Adobe PDF | View/Open | |
chapter 6.pdf | 725.52 kB | Adobe PDF | View/Open | |
chapter 7.pdf | 2.87 MB | Adobe PDF | View/Open | |
content.pdf | 223.55 kB | Adobe PDF | View/Open | |
prelim pages.pdf | 297.34 kB | Adobe PDF | View/Open | |
title.pdf | 114.38 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: