Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/522616
Title: | Improved big data privacy and security using hybrid elliptic curve cryptography with pillar K means clustering algorithm in secured map reduce layer |
Researcher: | Arogya Presskila X |
Guide(s): | Ramesh K |
Keywords: | Computer Science Computer Science Information Systems Engineering and Technology |
University: | Anna University |
Completed Date: | 2023 |
Abstract: | newline The traditional Big Data framework does not clearly state that both the structured and unstructured data sensitivity like health care, personal information data, online transaction data and so on. In addition, the concept of privacy and security of Big Data needs to be incorporated in the cluster nodes of the Map Reduce Layer in Hadoop framework, which is vulnerable and prone to be attacked. Accordingly, the threat of revealing personal data is subsequently alleviated in recent years. The proposed work Secured Map Reduce Layer (SMR) in the Big Data framework for improving the security and privacy of sensitive data through hybrid approach by integrating PSO (Particle Swarm Optimization) and Elliptic Curve Cryptographic mechanism. This proposed framework deals with, choosing the optimal private key for authentication, using Particle Swarm Optimization and protecting the data by Cryptographic mechanism of Elliptic Curve Cryptography. Particle swarm Optimization authorizes the global best fitness value to find the optimal private key with efficient authentication. The proposed methodology focuses on the storage of Big Data. In the first phase, Hadoop Distributed File System is used to store huge data in efficient ways and ensuring security and preserve privacy while data is processing in Map Reduce Layer. Efficient storage in HDFS is achieved by applying clustering for segregating the related data and group the similar data into blocks, which are stored in distributed nodes of HDFS for the ease of processing the data and preserving privacy. In this study, Semi Structured Medical Transcription text dataset is used. The initial step of clustering is text preprocessing to polish the text data to enhance the cluster outcomes by removing the noise. Traditional text pre-processing techniques like stop word removal, tokenization and stemming are applied in the dataset. The next step to convert the text data into numerical format for clustering is using tf-idf (term frequency-inverse document frequency) method. |
Pagination: | xvi, 134 p. |
URI: | http://hdl.handle.net/10603/522616 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 25.87 kB | Adobe PDF | View/Open |
02_prelim_pages.pdf | 3.56 MB | Adobe PDF | View/Open | |
03_content.pdf | 16.5 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 12.85 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 118.48 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 405.76 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 262.58 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 920.61 kB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 1.19 MB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 8.11 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 195.79 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 49.28 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: