Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/13437
Title: | Data transformation approaches for privacy preserving data mining |
Researcher: | Rajalaxmi R R |
Guide(s): | Natarajan, A.M. |
Keywords: | Data mining, privacy preserving, Utility Itemset, Frequent Itemset |
Upload Date: | 28-Nov-2013 |
University: | Anna University |
Completed Date: | |
Abstract: | Recent advances in data mining techniques facilitate to explore hidden knowledge from a large volume of data. When organizations share data for mining, they may restrict confidential information and knowledge to the other organizations. The existing studies have dealt with data transformation methods for numerical data to preserve privacy in clustering and also data sanitization approaches to hide sensitive patterns. It is essential to devise new data transformation methods for categorical data to preserve privacy in clustering. In this work, to begin with, sensitive categorical data protection in clustering is addressed. Two hybrid data transformation methods have been devised to transform the sensitive categorical data. Then, their effectiveness in privacy preservation and clustering accuracy are validated. It is found that scaling and rotation transformation method improves the privacy level and the translation and rotation transformation method provides better accuracy in clustering. Hiding sensitive association rules are implemented by concealing the frequent itemsets. Experimental results indicate that the use of item and transaction conflict ratio reduces the legitimate itemsets missed after sanitization. The work further focuses on sanitization approaches for privacy preservation of sensitive utility itemsets. The experimental results indicate that the item conflict degree improves results in terms of the legitimate itemsets lost. Privacy preservation of utility and frequent itemset is also considered and two data sanitization approaches have been developed. Based on the experimental results, it can be observed that the item conflict ratio based sanitization approach minimizes non-sensitive itemsets missed and modifications in the original database. To summarize, the research works devised data transformation approaches by which privacy was ensured while maintaining accuracy in data mining. newline newline newline |
Pagination: | xvi, 110 |
URI: | http://hdl.handle.net/10603/13437 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 48.96 kB | Adobe PDF | View/Open |
02_certificates.pdf | 884.19 kB | Adobe PDF | View/Open | |
03_abstract.pdf | 12.19 kB | Adobe PDF | View/Open | |
04_acknowledgement.pdf | 13.12 kB | Adobe PDF | View/Open | |
05_contents.pdf | 56.46 kB | Adobe PDF | View/Open | |
06_chapter 1.pdf | 40.16 kB | Adobe PDF | View/Open | |
07_chapter 2.pdf | 39.69 kB | Adobe PDF | View/Open | |
08_chapter 3.pdf | 79.15 kB | Adobe PDF | View/Open | |
09_chapter 4.pdf | 319.24 kB | Adobe PDF | View/Open | |
10_chapter 5.pdf | 296.56 kB | Adobe PDF | View/Open | |
11_chapter 6.pdf | 318.54 kB | Adobe PDF | View/Open | |
12_chapter 7.pdf | 17.4 kB | Adobe PDF | View/Open | |
13_references.pdf | 28.11 kB | Adobe PDF | View/Open | |
14_publications.pdf | 14.85 kB | Adobe PDF | View/Open | |
15_vitae.pdf | 11.15 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: