Tamil text extraction using dual tree complex wavelet transform and a weighted hybrid thresholding approach

Deepa S T

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/16677

Title:	Tamil text extraction using dual tree complex wavelet transform and a weighted hybrid thresholding approach
Researcher:	Deepa S T
Guide(s):	Victor S P
Keywords:	Tamil Text Computer Science Dual tree complex
Upload Date:	4-Mar-2014
University:	Mother Teresa Womens University
Completed Date:	01/10/2013
Abstract:	In recent times, Extraction of text and caption from images and videos is important and in great demand for video retrieval, annotation,indexing and content analysis. A challenging problem for character recognition applications is text extraction from images, due to language characteristic, complex background and unknown text color. The application areas are vast. There are numerous text extraction techniques existing for extracting text from the images. In spite of availability of some really good text extraction algorithms, there is still a need to develop more efficient ones. The present work describes a novel and an efficient text extraction scheme for text extraction using Dual Tree Complex Wavelet Transform and a Weighted hybrid Thresholding Approach for text binarization. The proposed text extraction scheme is designed in such a way as to preserve the finer details in the text. The theory and algorithm for extracting the text from images are established in detail. The methodology developed along with all the algorithms are presented and documented. It is demonstrated that the proposed method achieved reasonable accuracy of the text extraction for moderately difficult examples. The present work considers the Tamil Text extraction from images. Different level of Dual Tree Complex Wavelet Transform is used for Tamil Text extraction. The text from the input images is extracted. The text can be either scene text or superimposed text. Superimposed text is the text which embedded in the image after the image is captured. The Dual Tree wavelet Transform is used to extract the text from the images and a weighted hybrid thresholding method is used for text binarization. In text extraction, the text is extracted along with the background. The background is removed using text binarization. Text extraction involves two steps, filtering and morphological dilation.
Pagination:	133p.
URI:	http://hdl.handle.net/10603/16677
Appears in Departments:	Department of Computer Science

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	28.34 kB	Adobe PDF	View/Open
02_certificate.pdf		8.58 kB	Adobe PDF	View/Open
03_abstract.pdf		17.06 kB	Adobe PDF	View/Open
04_acknowledgement.pdf		8.52 kB	Adobe PDF	View/Open
05_content.pdf		18.96 kB	Adobe PDF	View/Open
06_list_of_figures.pdf		13.71 kB	Adobe PDF	View/Open
07_chapter 1.pdf		188.19 kB	Adobe PDF	View/Open
08_chapter 2.pdf		67.25 kB	Adobe PDF	View/Open
09_chapter 3.pdf		298.38 kB	Adobe PDF	View/Open
10_chapter 4.pdf		224.34 kB	Adobe PDF	View/Open
11_chapter 5.pdf		188.88 kB	Adobe PDF	View/Open
12_conclusion.pdf		13.91 kB	Adobe PDF	View/Open
13_bibliography.pdf		33.47 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET