Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/38989
Title: Detection and removal of redundant And illegitimate data in data Repository an empirical analysis
Researcher: Senthilkumar P
Guide(s): Suthanthira vanitha N
Keywords: Data cleansing
Levenshtein
Rabin s fingerprinting algorithm
Upload Date: 13-Apr-2015
University: Anna University
Completed Date: 01/08/2014
Abstract: Data cleansing is described as the sum of operations executed on newlineexisting data to eliminate anomalies and obtain a data collection being a newlineprecise and exclusive representation These data anomalies that contain errors discrepancies redundancies ambiguities and incompleteness hinder the newlineeffectiveness of analysis or data mining Decreasing the time and intricacies newlineof the mining process and improving the quality of datum present in the data newlinewarehouse are the important objectives of data cleansing With the intention newlineof this, the efficient technique is proposed capable of providing accurate data newlinerecords by removing the errors such as duplicate records near duplicate newlinerecords misspelling errors and illegal value errors which usually arise when newlinedata is warehoused from external sources In our proposed technique after the newlinepreprocessing steps Rabin s fingerprinting algorithm and Levenshtein newlinedistance is used for cleansing the dataset from duplicate records and nearduplicate newlinerecords respectively For correcting misspelling errors Levenshtein newlineedit distance method is utilized and the illegal value errors are identified using newlineRule Based method newline newline
Pagination: xiii, 141p.
URI: http://hdl.handle.net/10603/38989
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File31.61 kBAdobe PDFView/Open
02_certificate.pdf130.83 kBAdobe PDFView/Open
03_abstract.pdf60.81 kBAdobe PDFView/Open
04_acknowledgement.pdf58.29 kBAdobe PDFView/Open
05_content.pdf254.61 kBAdobe PDFView/Open
06_chapter1.pdf550.31 kBAdobe PDFView/Open
07_chapter2.pdf1.26 MBAdobe PDFView/Open
08_chapter3.pdf1.35 MBAdobe PDFView/Open
09_chapter4.pdf4.2 MBAdobe PDFView/Open
10_chapter5.pdf3.11 MBAdobe PDFView/Open
11_chapter6.pdf1.99 MBAdobe PDFView/Open
12_chapter7.pdf76.22 kBAdobe PDFView/Open
13_reference.pdf675.4 kBAdobe PDFView/Open
14_publication.pdf64.07 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: