Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/4166
Title: | An investigation into Telugu font and character recognition |
Researcher: | Ghadiyaram, Anuradha |
Guide(s): | Agarwal, Arun |
Keywords: | Telugu Fonts Character Recognition Computer Science Computer and Information Sciences |
Upload Date: | 9-Aug-2012 |
University: | University of Hyderabad |
Completed Date: | 2009 |
Abstract: | Optical Character Recognition (OCR) for Indian scripts and for handwriting is an active area of research. OCR systems for Indian and many other oriental languages are not yet able to successfully recognize printed documents of varying scripts, quality, font styls and sizes. To address some of these issues, we have designed and developed a framework to simultaneously recognize fonts and characters from documents printed in Telugu. Telugu script is structurally complicated due to the large number of character shapes that can be formed. Specifically, we developed a system based on Rough Sets theory for recognizing Telugu fonts and characters. Using the theory of rough sets, we determine the most important features of characters from the point of view of classification. The present work achieved 94.37%, and 90.17% accuracies for character and font recognitions respectively. These are the raw recognition accuracies without doing any post processing. The present model emphasizes the role of theory of rough sets in feature selection and dimension reduction in Telugu font and character recognition. A comparative study with related earlier works and our contributions to the field of Telugu character recognition are presented. The developed approach falls into the category of a priori classification, where domain knowledge of the content is not used. No explicit local feature analysis is used in this methodology. Applicability of the proposed approach is tested on documents printed in English, which is an entirely different script from Telugu. Thus, a general framework for script-independent character and font recognitions is suggestive from the present methodology. Encouraging recognition accuracies in both the scripts (Telugu and English) are achieved with a scope for further improvement. |
Pagination: | 192p. |
URI: | http://hdl.handle.net/10603/4166 |
Appears in Departments: | Department of Computer & Information Sciences |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 45.58 kB | Adobe PDF | View/Open |
02_certificate.pdf | 52.46 kB | Adobe PDF | View/Open | |
03_declaration.pdf | 21.87 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 23.79 kB | Adobe PDF | View/Open | |
05_acknowledgements.pdf | 23.53 kB | Adobe PDF | View/Open | |
06_contents.pdf | 37.44 kB | Adobe PDF | View/Open | |
07_list of figures.pdf | 23.53 kB | Adobe PDF | View/Open | |
08_list of tables.pdf | 23.47 kB | Adobe PDF | View/Open | |
09_chapter 1.pdf | 73.95 kB | Adobe PDF | View/Open | |
10_chapter 2.pdf | 863.02 kB | Adobe PDF | View/Open | |
11_chapter 3.pdf | 465.53 kB | Adobe PDF | View/Open | |
12_chapter 4.pdf | 4.37 MB | Adobe PDF | View/Open | |
13_chapter 5.pdf | 512.15 kB | Adobe PDF | View/Open | |
14_chapter 6.pdf | 108.96 kB | Adobe PDF | View/Open | |
15_chapter 7.pdf | 7.53 MB | Adobe PDF | View/Open | |
16_references.pdf | 67.06 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: