Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/310263
Full metadata record
DC FieldValueLanguage
dc.coverage.spatialComputer Science
dc.date.accessioned2020-12-31T11:35:17Z-
dc.date.available2020-12-31T11:35:17Z-
dc.identifier.urihttp://hdl.handle.net/10603/310263-
dc.description.abstractExtract Transform Load refers to a database process framework entrusted with the task of extraction, transformation, and loading of data in data warehouse. Web Data Extraction algorithm is proposed wherein web templates are identified by developing feature-based web data extraction algorithm by clustering the similar web pages together based on feature similarity of their DOM structure. The hybrid transformation technique is proposed that employs token-wise sentence sorting alongwith Levenshtein distance for noise reduction. The RDBMS is replaced with distributed failsafe data clusters as data warehouse using Hadoop based techniques. This delimits the constraint of data processing, storage and retrieval of large data structure. The developed algorithm is validated on USPTO web site. newline
dc.format.extentxvi, 116p.
dc.languageEnglish
dc.relation-
dc.rightsuniversity
dc.titleDevelopment of optimized algorithm for extract transform load process using soft computing techniques
dc.title.alternative
dc.creator.researcherGupta, Gaurav
dc.subject.keywordExtraction
dc.subject.keywordHadoop
dc.subject.keywordLoading
dc.subject.keywordTransformation
dc.subject.keywordWeb Template
dc.description.noteBibliography 101-116p.
dc.contributor.guideChhabra, Indu and Kumar, Neelesh
dc.publisher.placeChandigarh
dc.publisher.universityPanjab University
dc.publisher.institutionDepartment of Computer Science and Application
dc.date.registered2013
dc.date.completed2020
dc.date.awarded2020
dc.format.dimensions-
dc.format.accompanyingmaterialCD
dc.source.universityUniversity
dc.type.degreePh.D.
Appears in Departments:Department of Computer Science and Application

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File13.27 kBAdobe PDFView/Open
02_certificate.pdf932.37 kBAdobe PDFView/Open
03_acknowledgement.pdf12.45 kBAdobe PDFView/Open
04_contents.pdf607.67 kBAdobe PDFView/Open
05_abstract.pdf34.46 kBAdobe PDFView/Open
06_abbreviations.pdf22.16 kBAdobe PDFView/Open
07_list_of_figures.pdf22.43 kBAdobe PDFView/Open
08_list_of_tables.pdf22.41 kBAdobe PDFView/Open
09_list_of_publications.pdf14.58 kBAdobe PDFView/Open
10_chapter1.pdf814.73 kBAdobe PDFView/Open
11_chapter2.pdf865.39 kBAdobe PDFView/Open
12_chapter3.pdf1.08 MBAdobe PDFView/Open
13_chapter4.pdf1.75 MBAdobe PDFView/Open
14_chapter5.pdf1.02 MBAdobe PDFView/Open
15_chapter6.pdf645.73 kBAdobe PDFView/Open
16_references.pdf776.4 kBAdobe PDFView/Open
80_recommendation.pdf645.73 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: