Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/308474
Full metadata record
DC FieldValueLanguage
dc.coverage.spatial
dc.date.accessioned2020-12-08T11:05:50Z-
dc.date.available2020-12-08T11:05:50Z-
dc.identifier.urihttp://hdl.handle.net/10603/308474-
dc.description.abstractRepresentation learning has been a key investigation in pattern recognition. The primary goal of this thesis is to learn efficient representations for word images from scanned document images. An ideal representation should be invariant to multiple fonts, handwritten styles and less sensitive to noise and degradations. In this work, we choose the paradigm of learning from data using deep neural networks. newline newlineThe first contribution of this thesis is a simple technique to generate large amounts of synthetic data, useful for pre-training deep neural networks. This led to the creation of IIIT-HWS dataset which is now widely used in the document community. The other major contributions of this thesis are: (a) the design of a deep convolutional architecture (named as HWNet) for learning an efficient holistic representation for word images, (b) a joint embedding scheme to project words and textual strings onto a common subspace, and (c) a novel form of word image representation which respects the word form along with its semantic meaning. The learned representations are evaluated under the tasks of word spotting and word recognition. We report state-of-the-art performance on popular datasets under both modern/historical and handwritten/printed document images while keeping the representation size compact in nature. Finally, in order to validate the proposed representations of this thesis, we present some interesting use cases such as (i) finding similarity between a pair of handwritten documents images, (ii) searching for keywords from online lecture videos, and (iii) building word retrieval system for Indic scripts.
dc.format.extent
dc.languageEnglish
dc.relation
dc.rightsuniversity
dc.titleLearning Representations for Word Images
dc.title.alternative
dc.creator.researcherKrishnan Praveen
dc.subject.keywordComputer Science
dc.subject.keywordComputer Science Information Systems
dc.subject.keywordEngineering and Technology
dc.description.note
dc.contributor.guideJawahar C.V.
dc.publisher.placeHyderabad
dc.publisher.universityInternational Institute of Information Technology, Hyderabad
dc.publisher.institutionComputer Science and Engineering
dc.date.registered2012
dc.date.completed2020
dc.date.awarded2020
dc.format.dimensions
dc.format.accompanyingmaterialNone
dc.source.universityUniversity
dc.type.degreePh.D.
Appears in Departments:Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
80_recommendation.pdfAttached File202.94 kBAdobe PDFView/Open
certificate.pdf44.19 kBAdobe PDFView/Open
chapter1.pdf2.96 MBAdobe PDFView/Open
chapter2.pdf7.08 MBAdobe PDFView/Open
chapter3.pdf4.28 MBAdobe PDFView/Open
chapter4.pdf3.68 MBAdobe PDFView/Open
chapter5.pdf3.28 MBAdobe PDFView/Open
chapter6.pdf8.85 MBAdobe PDFView/Open
preliminary_pages.pdf876.05 kBAdobe PDFView/Open
title.pdf75.69 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: