An Efficient Framework for Privacy Preservation for Big Data Applications

Kaur, Harmanjeet

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/423754

Title:	An Efficient Framework for Privacy Preservation for Big Data Applications
Researcher:	Kaur, Harmanjeet
Guide(s):	Kumar, Neeraj and Batra, Shalini
Keywords:	Big Data Computer Science Computer Science Theory and Methods Engineering and Technology Privacy Preservation
University:	Thapar Institute of Engineering and Technology
Completed Date:	2020
Abstract:	In the modern data-driven world, the actual advantage of big data can be realized if data is efficiently processed and knowledge extracted from it can serve as an important component in decision making. Data mining techniques have been used to discover interesting patterns and knowledge from large datasets. Providing all the data to data miners may provide good analytics, but it can also raise many security challenges since such data can be misused by malicious users. Thus, equilibrium should be maintained between data availability and data security as one needs to secure the confidentiality of sensitive data without affecting the efficiency of applications. Privacy preserving data mining techniques are used to extract useful information from data without compromising the security of sensitive information contained in it. Before performing any analysis on data set, it is anonymized by encryption techniques or by removing the personally identifiable information from data sets, such that the person whom the data refers will remain anonymous. The data sets used for the data mining purpose can be centralized owned by a single owner or it can be distributed among multiple parties having horizontal, vertical or arbitrary distribution. Usage of traditional cryptographic techniques for protecting the information leads to large computation and communication overheads especially, for large datasets. The anonymization techniques have less computation and communication overheads, but there is a risk of re-identification of anonymized dataset, since a large amount of data is available and by linking the different data sources with the anonymized dataset, the probability of re-identification of data is higher. This thesis proposes a framework for privacy preserving data mining on big data. Based on the proposed framework, two application domains have been identified.
Pagination:	xiv, 160p.
URI:	http://hdl.handle.net/10603/423754
Appears in Departments:	Department of Computer Science and Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	88.85 kB	Adobe PDF	View/Open
02_prelim pages.pdf		1.4 MB	Adobe PDF	View/Open
03_content.pdf		65.57 kB	Adobe PDF	View/Open
04_abstract.pdf		47.35 kB	Adobe PDF	View/Open
05_chapter 1.pdf		822.56 kB	Adobe PDF	View/Open
06_chapter 2.pdf		246.19 kB	Adobe PDF	View/Open
07_chapter 3.pdf		985.94 kB	Adobe PDF	View/Open
08_chapter 4.pdf		382.64 kB	Adobe PDF	View/Open
09_chapter 5.pdf		410.57 kB	Adobe PDF	View/Open
10_chapter 6.pdf		96.05 kB	Adobe PDF	View/Open
11_annexures.pdf		136.84 kB	Adobe PDF	View/Open
80_recommendation.pdf		568.5 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET