Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/423754
Title: | An Efficient Framework for Privacy Preservation for Big Data Applications |
Researcher: | Kaur, Harmanjeet |
Guide(s): | Kumar, Neeraj and Batra, Shalini |
Keywords: | Big Data Computer Science Computer Science Theory and Methods Engineering and Technology Privacy Preservation |
University: | Thapar Institute of Engineering and Technology |
Completed Date: | 2020 |
Abstract: | In the modern data-driven world, the actual advantage of big data can be realized if data is efficiently processed and knowledge extracted from it can serve as an important component in decision making. Data mining techniques have been used to discover interesting patterns and knowledge from large datasets. Providing all the data to data miners may provide good analytics, but it can also raise many security challenges since such data can be misused by malicious users. Thus, equilibrium should be maintained between data availability and data security as one needs to secure the confidentiality of sensitive data without affecting the efficiency of applications. Privacy preserving data mining techniques are used to extract useful information from data without compromising the security of sensitive information contained in it. Before performing any analysis on data set, it is anonymized by encryption techniques or by removing the personally identifiable information from data sets, such that the person whom the data refers will remain anonymous. The data sets used for the data mining purpose can be centralized owned by a single owner or it can be distributed among multiple parties having horizontal, vertical or arbitrary distribution. Usage of traditional cryptographic techniques for protecting the information leads to large computation and communication overheads especially, for large datasets. The anonymization techniques have less computation and communication overheads, but there is a risk of re-identification of anonymized dataset, since a large amount of data is available and by linking the different data sources with the anonymized dataset, the probability of re-identification of data is higher. This thesis proposes a framework for privacy preserving data mining on big data. Based on the proposed framework, two application domains have been identified. |
Pagination: | xiv, 160p. |
URI: | http://hdl.handle.net/10603/423754 |
Appears in Departments: | Department of Computer Science and Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 88.85 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 1.4 MB | Adobe PDF | View/Open | |
03_content.pdf | 65.57 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 47.35 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 822.56 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 246.19 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 985.94 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 382.64 kB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 410.57 kB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 96.05 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 136.84 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 568.5 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: