Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/16677
Title: | Tamil text extraction using dual tree complex wavelet transform and a weighted hybrid thresholding approach |
Researcher: | Deepa S T |
Guide(s): | Victor S P |
Keywords: | Tamil Text Computer Science Dual tree complex |
Upload Date: | 4-Mar-2014 |
University: | Mother Teresa Womens University |
Completed Date: | 01/10/2013 |
Abstract: | In recent times, Extraction of text and caption from images and videos is important and in great demand for video retrieval, annotation,indexing and content analysis. A challenging problem for character recognition applications is text extraction from images, due to language characteristic, complex background and unknown text color. The application areas are vast. There are numerous text extraction techniques existing for extracting text from the images. In spite of availability of some really good text extraction algorithms, there is still a need to develop more efficient ones. The present work describes a novel and an efficient text extraction scheme for text extraction using Dual Tree Complex Wavelet Transform and a Weighted hybrid Thresholding Approach for text binarization. The proposed text extraction scheme is designed in such a way as to preserve the finer details in the text. The theory and algorithm for extracting the text from images are established in detail. The methodology developed along with all the algorithms are presented and documented. It is demonstrated that the proposed method achieved reasonable accuracy of the text extraction for moderately difficult examples. The present work considers the Tamil Text extraction from images. Different level of Dual Tree Complex Wavelet Transform is used for Tamil Text extraction. The text from the input images is extracted. The text can be either scene text or superimposed text. Superimposed text is the text which embedded in the image after the image is captured. The Dual Tree wavelet Transform is used to extract the text from the images and a weighted hybrid thresholding method is used for text binarization. In text extraction, the text is extracted along with the background. The background is removed using text binarization. Text extraction involves two steps, filtering and morphological dilation. |
Pagination: | 133p. |
URI: | http://hdl.handle.net/10603/16677 |
Appears in Departments: | Department of Computer Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 28.34 kB | Adobe PDF | View/Open |
02_certificate.pdf | 8.58 kB | Adobe PDF | View/Open | |
03_abstract.pdf | 17.06 kB | Adobe PDF | View/Open | |
04_acknowledgement.pdf | 8.52 kB | Adobe PDF | View/Open | |
05_content.pdf | 18.96 kB | Adobe PDF | View/Open | |
06_list_of_figures.pdf | 13.71 kB | Adobe PDF | View/Open | |
07_chapter 1.pdf | 188.19 kB | Adobe PDF | View/Open | |
08_chapter 2.pdf | 67.25 kB | Adobe PDF | View/Open | |
09_chapter 3.pdf | 298.38 kB | Adobe PDF | View/Open | |
10_chapter 4.pdf | 224.34 kB | Adobe PDF | View/Open | |
11_chapter 5.pdf | 188.88 kB | Adobe PDF | View/Open | |
12_conclusion.pdf | 13.91 kB | Adobe PDF | View/Open | |
13_bibliography.pdf | 33.47 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: