Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/308474
Title: Learning Representations for Word Images
Researcher: Krishnan Praveen
Guide(s): Jawahar C.V.
Keywords: Computer Science
Computer Science Information Systems
Engineering and Technology
University: International Institute of Information Technology, Hyderabad
Completed Date: 2020
Abstract: Representation learning has been a key investigation in pattern recognition. The primary goal of this thesis is to learn efficient representations for word images from scanned document images. An ideal representation should be invariant to multiple fonts, handwritten styles and less sensitive to noise and degradations. In this work, we choose the paradigm of learning from data using deep neural networks. newline newlineThe first contribution of this thesis is a simple technique to generate large amounts of synthetic data, useful for pre-training deep neural networks. This led to the creation of IIIT-HWS dataset which is now widely used in the document community. The other major contributions of this thesis are: (a) the design of a deep convolutional architecture (named as HWNet) for learning an efficient holistic representation for word images, (b) a joint embedding scheme to project words and textual strings onto a common subspace, and (c) a novel form of word image representation which respects the word form along with its semantic meaning. The learned representations are evaluated under the tasks of word spotting and word recognition. We report state-of-the-art performance on popular datasets under both modern/historical and handwritten/printed document images while keeping the representation size compact in nature. Finally, in order to validate the proposed representations of this thesis, we present some interesting use cases such as (i) finding similarity between a pair of handwritten documents images, (ii) searching for keywords from online lecture videos, and (iii) building word retrieval system for Indic scripts.
Pagination: 
URI: http://hdl.handle.net/10603/308474
Appears in Departments:Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
80_recommendation.pdfAttached File202.94 kBAdobe PDFView/Open
certificate.pdf44.19 kBAdobe PDFView/Open
chapter1.pdf2.96 MBAdobe PDFView/Open
chapter2.pdf7.08 MBAdobe PDFView/Open
chapter3.pdf4.28 MBAdobe PDFView/Open
chapter4.pdf3.68 MBAdobe PDFView/Open
chapter5.pdf3.28 MBAdobe PDFView/Open
chapter6.pdf8.85 MBAdobe PDFView/Open
preliminary_pages.pdf876.05 kBAdobe PDFView/Open
title.pdf75.69 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: