Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/38989
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.coverage.spatial | Detection and removal of redundant And illegitimate data in data Repository an empirical analysis | en_US |
dc.date.accessioned | 2015-04-13T10:10:47Z | - |
dc.date.available | 2015-04-13T10:10:47Z | - |
dc.date.issued | 2015-04-13 | - |
dc.identifier.uri | http://hdl.handle.net/10603/38989 | - |
dc.description.abstract | Data cleansing is described as the sum of operations executed on newlineexisting data to eliminate anomalies and obtain a data collection being a newlineprecise and exclusive representation These data anomalies that contain errors discrepancies redundancies ambiguities and incompleteness hinder the newlineeffectiveness of analysis or data mining Decreasing the time and intricacies newlineof the mining process and improving the quality of datum present in the data newlinewarehouse are the important objectives of data cleansing With the intention newlineof this, the efficient technique is proposed capable of providing accurate data newlinerecords by removing the errors such as duplicate records near duplicate newlinerecords misspelling errors and illegal value errors which usually arise when newlinedata is warehoused from external sources In our proposed technique after the newlinepreprocessing steps Rabin s fingerprinting algorithm and Levenshtein newlinedistance is used for cleansing the dataset from duplicate records and nearduplicate newlinerecords respectively For correcting misspelling errors Levenshtein newlineedit distance method is utilized and the illegal value errors are identified using newlineRule Based method newline newline | en_US |
dc.format.extent | xiii, 141p. | en_US |
dc.language | English | en_US |
dc.relation | p133-140. | en_US |
dc.rights | university | en_US |
dc.title | Detection and removal of redundant And illegitimate data in data Repository an empirical analysis | en_US |
dc.title.alternative | en_US | |
dc.creator.researcher | Senthilkumar P | en_US |
dc.subject.keyword | Data cleansing | en_US |
dc.subject.keyword | Levenshtein | en_US |
dc.subject.keyword | Rabin s fingerprinting algorithm | en_US |
dc.description.note | reference p133-140. | en_US |
dc.contributor.guide | Suthanthira vanitha N | en_US |
dc.publisher.place | Chennai | en_US |
dc.publisher.university | Anna University | en_US |
dc.publisher.institution | Faculty of Information and Communication Engineering | en_US |
dc.date.registered | n.d, | en_US |
dc.date.completed | 01/08/2014 | en_US |
dc.date.awarded | 30/08/2014 | en_US |
dc.format.dimensions | 23cm. | en_US |
dc.format.accompanyingmaterial | None | en_US |
dc.source.university | University | en_US |
dc.type.degree | Ph.D. | en_US |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 31.61 kB | Adobe PDF | View/Open |
02_certificate.pdf | 130.83 kB | Adobe PDF | View/Open | |
03_abstract.pdf | 60.81 kB | Adobe PDF | View/Open | |
04_acknowledgement.pdf | 58.29 kB | Adobe PDF | View/Open | |
05_content.pdf | 254.61 kB | Adobe PDF | View/Open | |
06_chapter1.pdf | 550.31 kB | Adobe PDF | View/Open | |
07_chapter2.pdf | 1.26 MB | Adobe PDF | View/Open | |
08_chapter3.pdf | 1.35 MB | Adobe PDF | View/Open | |
09_chapter4.pdf | 4.2 MB | Adobe PDF | View/Open | |
10_chapter5.pdf | 3.11 MB | Adobe PDF | View/Open | |
11_chapter6.pdf | 1.99 MB | Adobe PDF | View/Open | |
12_chapter7.pdf | 76.22 kB | Adobe PDF | View/Open | |
13_reference.pdf | 675.4 kB | Adobe PDF | View/Open | |
14_publication.pdf | 64.07 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: