Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/423812
Full metadata record
DC FieldValueLanguage
dc.coverage.spatial
dc.date.accessioned2022-12-09T10:47:23Z-
dc.date.available2022-12-09T10:47:23Z-
dc.identifier.urihttp://hdl.handle.net/10603/423812-
dc.description.abstractIn today s information overloaded world, data has become the epicentre of the entire research. Textual data in the form of log, news papers, web documents, etc. is a key source of data analytics. Apart from textual contents, images, videos, audios generated by various handy devices are shared and downloaded by millions of users across the globe, every second. Finding similar items in such large and unstructured datasets (text and image) is indeed a challenging task. The exact match rarely has meaning in these environments; proximity or distance among the items is a preferred choice to identify similar items. In this work three similarity search approaches have been proposed: one for text documents and two for image datasets. For the textual data, a parallel similarity search approach has been proposed which uses Bloom filters for the representation of the features of the document and comparison with user s query. Query features are stored in an integer array. The proposed approach uses approximate similarity search; has been implemented on Graphics Processing Unit (GPU) with compute unified device architecture as the programming platform. Two approaches have been proposed for image dataset. Both approaches uses Content Based Image Retrieval (CBIR). First CBIR approach named as Bi-layer Content Based Image Retrieval (BiCBIR) System consists of two modules: first module extracts the features of images in terms of color, texture and shape. Second module consists of two layers: initially all images are compared with query image for shape and texture feature space and indexes of M images similar to the query image are retrieved. Next, M images retrieved from previous layer are matched with query image for shape and color feature space and finally F images similar to the query image are returned as output. Second approach, Feature wise Incremental CBIR, named as FiCBIR, uses color, texture, and shape features.
dc.format.extent130p.
dc.languageEnglish
dc.relation
dc.rightsuniversity
dc.titleEfficient Similarity Search Techniques for Textual and Non Textual Datasets
dc.title.alternative
dc.creator.researcherChauhan, Sachendra Singh
dc.subject.keywordComputer Science
dc.subject.keywordComputer Science Theory and Methods
dc.subject.keywordEngineering and Technology
dc.description.note
dc.contributor.guideBatra, Shalini
dc.publisher.placePatiala
dc.publisher.universityThapar Institute of Engineering and Technology
dc.publisher.institutionDepartment of Computer Science and Engineering
dc.date.registered
dc.date.completed2020
dc.date.awarded2020
dc.format.dimensions
dc.format.accompanyingmaterialNone
dc.source.universityUniversity
dc.type.degreePh.D.
Appears in Departments:Department of Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File100.42 kBAdobe PDFView/Open
02_prelim pages.pdf363.44 kBAdobe PDFView/Open
03_content.pdf72.26 kBAdobe PDFView/Open
04_abstract.pdf89.98 kBAdobe PDFView/Open
05_chapter 1.pdf1.4 MBAdobe PDFView/Open
06_chapter 2.pdf267.46 kBAdobe PDFView/Open
07_chapter 3.pdf619.74 kBAdobe PDFView/Open
08_chapter 4.pdf2.08 MBAdobe PDFView/Open
09_chapter 5.pdf401.04 kBAdobe PDFView/Open
10_chapter 6.pdf97.43 kBAdobe PDFView/Open
11_annexures.pdf155.58 kBAdobe PDFView/Open
80_recommendation.pdf131.69 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: