Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/16677
Title: Tamil text extraction using dual tree complex wavelet transform and a weighted hybrid thresholding approach
Researcher: Deepa S T
Guide(s): Victor S P
Keywords: Tamil Text
Computer Science
Dual tree complex
Upload Date: 4-Mar-2014
University: Mother Teresa Womens University
Completed Date: 01/10/2013
Abstract: In recent times, Extraction of text and caption from images and videos is important and in great demand for video retrieval, annotation,indexing and content analysis. A challenging problem for character recognition applications is text extraction from images, due to language characteristic, complex background and unknown text color. The application areas are vast. There are numerous text extraction techniques existing for extracting text from the images. In spite of availability of some really good text extraction algorithms, there is still a need to develop more efficient ones. The present work describes a novel and an efficient text extraction scheme for text extraction using Dual Tree Complex Wavelet Transform and a Weighted hybrid Thresholding Approach for text binarization. The proposed text extraction scheme is designed in such a way as to preserve the finer details in the text. The theory and algorithm for extracting the text from images are established in detail. The methodology developed along with all the algorithms are presented and documented. It is demonstrated that the proposed method achieved reasonable accuracy of the text extraction for moderately difficult examples. The present work considers the Tamil Text extraction from images. Different level of Dual Tree Complex Wavelet Transform is used for Tamil Text extraction. The text from the input images is extracted. The text can be either scene text or superimposed text. Superimposed text is the text which embedded in the image after the image is captured. The Dual Tree wavelet Transform is used to extract the text from the images and a weighted hybrid thresholding method is used for text binarization. In text extraction, the text is extracted along with the background. The background is removed using text binarization. Text extraction involves two steps, filtering and morphological dilation.
Pagination: 133p.
URI: http://hdl.handle.net/10603/16677
Appears in Departments:Department of Computer Science

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File28.34 kBAdobe PDFView/Open
02_certificate.pdf8.58 kBAdobe PDFView/Open
03_abstract.pdf17.06 kBAdobe PDFView/Open
04_acknowledgement.pdf8.52 kBAdobe PDFView/Open
05_content.pdf18.96 kBAdobe PDFView/Open
06_list_of_figures.pdf13.71 kBAdobe PDFView/Open
07_chapter 1.pdf188.19 kBAdobe PDFView/Open
08_chapter 2.pdf67.25 kBAdobe PDFView/Open
09_chapter 3.pdf298.38 kBAdobe PDFView/Open
10_chapter 4.pdf224.34 kBAdobe PDFView/Open
11_chapter 5.pdf188.88 kBAdobe PDFView/Open
12_conclusion.pdf13.91 kBAdobe PDFView/Open
13_bibliography.pdf33.47 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: