Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/423754
Title: An Efficient Framework for Privacy Preservation for Big Data Applications
Researcher: Kaur, Harmanjeet
Guide(s): Kumar, Neeraj and Batra, Shalini
Keywords: Big Data
Computer Science
Computer Science Theory and Methods
Engineering and Technology
Privacy Preservation
University: Thapar Institute of Engineering and Technology
Completed Date: 2020
Abstract: In the modern data-driven world, the actual advantage of big data can be realized if data is efficiently processed and knowledge extracted from it can serve as an important component in decision making. Data mining techniques have been used to discover interesting patterns and knowledge from large datasets. Providing all the data to data miners may provide good analytics, but it can also raise many security challenges since such data can be misused by malicious users. Thus, equilibrium should be maintained between data availability and data security as one needs to secure the confidentiality of sensitive data without affecting the efficiency of applications. Privacy preserving data mining techniques are used to extract useful information from data without compromising the security of sensitive information contained in it. Before performing any analysis on data set, it is anonymized by encryption techniques or by removing the personally identifiable information from data sets, such that the person whom the data refers will remain anonymous. The data sets used for the data mining purpose can be centralized owned by a single owner or it can be distributed among multiple parties having horizontal, vertical or arbitrary distribution. Usage of traditional cryptographic techniques for protecting the information leads to large computation and communication overheads especially, for large datasets. The anonymization techniques have less computation and communication overheads, but there is a risk of re-identification of anonymized dataset, since a large amount of data is available and by linking the different data sources with the anonymized dataset, the probability of re-identification of data is higher. This thesis proposes a framework for privacy preserving data mining on big data. Based on the proposed framework, two application domains have been identified.
Pagination: xiv, 160p.
URI: http://hdl.handle.net/10603/423754
Appears in Departments:Department of Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File88.85 kBAdobe PDFView/Open
02_prelim pages.pdf1.4 MBAdobe PDFView/Open
03_content.pdf65.57 kBAdobe PDFView/Open
04_abstract.pdf47.35 kBAdobe PDFView/Open
05_chapter 1.pdf822.56 kBAdobe PDFView/Open
06_chapter 2.pdf246.19 kBAdobe PDFView/Open
07_chapter 3.pdf985.94 kBAdobe PDFView/Open
08_chapter 4.pdf382.64 kBAdobe PDFView/Open
09_chapter 5.pdf410.57 kBAdobe PDFView/Open
10_chapter 6.pdf96.05 kBAdobe PDFView/Open
11_annexures.pdf136.84 kBAdobe PDFView/Open
80_recommendation.pdf568.5 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: