Improved big data privacy and security using hybrid elliptic curve cryptography with pillar K means clustering algorithm in secured map reduce layer

Arogya Presskila X

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/522606

Title:	Improved big data privacy and security using hybrid elliptic curve cryptography with pillar K means clustering algorithm in secured map reduce layer
Researcher:	Arogya Presskila X
Guide(s):	Ramesh K
Keywords:	Computer Science Computer Science Information Systems Engineering and Technology Hadoop Framework Pillar K-Means Clustering Swarm Optimization
University:	Anna University
Completed Date:	2023
Abstract:	The traditional Big Data framework does not clearly state that both the structured and unstructured data sensitivity like health care, personal information data, online transaction data and so on. In addition, the concept of privacy and security of Big Data needs to be incorporated in the cluster nodes of the Map Reduce Layer in Hadoop framework, which is vulnerable and prone to be attacked. Accordingly, the threat of revealing personal data is subsequently alleviated in recent years. The proposed work Secured Map Reduce Layer (SMR) in the Big Data framework for improving the security and privacy of sensitive data through hybrid approach by integrating PSO (Particle Swarm Optimization) and Elliptic Curve Cryptographic mechanism. This proposed framework deals with, choosing the optimal private key for authentication, using Particle Swarm Optimization and protecting the data by Cryptographic mechanism of Elliptic Curve Cryptography. Particle swarm Optimization authorizes the global best fitness value to find the optimal private key with efficient authentication. The proposed methodology focuses on the storage of Big Data. In the first phase, Hadoop Distributed File System is used to store huge data in efficient ways and ensuring security and preserve privacy while data is processing in Map Reduce Layer. Efficient storage in HDFS is achieved by applying clustering for segregating the related data and group the similar data into blocks, which are stored in distributed nodes of HDFS for the ease of processing the data and preserving privacy. In this study, Semi Structured Medical Transcription text dataset is used. The initial step of clustering is text preprocessing to polish the text data to enhance the cluster outcomes by removing the noise. Traditional text pre-processing techniques like stop word removal, tokenization and stemming are applied in the dataset. The next step to convert the text data into numerical format for clustering is using tf-idf (term frequency-inverse document frequency) method. K-means and P
Pagination:	xvi, 134 p.
URI:	http://hdl.handle.net/10603/522606
Appears in Departments:	Faculty of Information and Communication Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	25.87 kB	Adobe PDF	View/Open
02_prelim_pages.pdf		3.56 MB	Adobe PDF	View/Open
03_content.pdf		16.5 kB	Adobe PDF	View/Open
04_abstract.pdf		12.85 kB	Adobe PDF	View/Open
05_chapter 1.pdf		118.48 kB	Adobe PDF	View/Open
06_chapter 2.pdf		405.76 kB	Adobe PDF	View/Open
07_chapter 3.pdf		262.58 kB	Adobe PDF	View/Open
08_chapter 4.pdf		920.61 kB	Adobe PDF	View/Open
09_chapter 5.pdf		1.19 MB	Adobe PDF	View/Open
10_chapter 6.pdf		8.11 kB	Adobe PDF	View/Open
11_annexures.pdf		195.79 kB	Adobe PDF	View/Open
80_recommendation.pdf		49.28 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET