Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/4448
Title: | Detection and segmentation of touching handwritten Gurmukhi script |
Researcher: | Rajiv, Kumar |
Guide(s): | Singh, Amardeep |
Keywords: | Segmentation Natural Language Automatic Document Processing Character Recognition Optical Character Recognition (OCR) Computer Engineering |
Upload Date: | 31-Aug-2012 |
University: | Punjabi University |
Completed Date: | July, 2011 |
Abstract: | The scanned image of the text is not of any use for user, because that image is not editable. One can not make any change if required to the scanned document. This provides a food for thought for the theory of optical character recognition (OCR). OCR is nothing but character recognition of a segmented part of the scanned image. The overall OCR process consists of three major sub processes like pre processing, segmentation and then recognition. Out of these three, the segmentation process is the back bone of the overall OCR process. We can say that the segmentation process is the most significant process because if the segmentation is incorrect then we can not have the correct results; it is just like garbage in and garbage out. Therefore the segmented part of the image would be such that it should provide a close relation to the character to be recognized. Hence segmentation plays an important role in the OCR process. There are problems in segmentation process, but the degree of the problems varies from script to script, that is, the problem set for segmentation of the text written in a particular script may differ than the problem set for the text written in other scripts. The characteristics of the script, plays a significant role in deciding the segmentation points. But it is not an easy job, because segmentation is one of the complex processes. It is more difficult if the document is handwritten because in that case only few points are there which can be used to make segmentation. In the present study, we formulate an approach to segment the scanned document image. As per this approach, initially this considers the whole image as one large window. Then this large window is broken into less large windows giving lines, once the lines are identified then each window consisting of a line is used to find a word present in that line. For that purpose we used the concept of variable sized window, that is, the window whose size can be adjusted according to needs. |
Pagination: | ii, 144p. |
URI: | http://hdl.handle.net/10603/4448 |
Appears in Departments: | University College of Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 21.78 kB | Adobe PDF | View/Open |
02_certificate.pdf | 15.39 kB | Adobe PDF | View/Open | |
03_declaration.pdf | 15.38 kB | Adobe PDF | View/Open | |
04_acknowledgements.pdf | 21.26 kB | Adobe PDF | View/Open | |
05_index.pdf | 25.95 kB | Adobe PDF | View/Open | |
06_list of figures.pdf | 27.6 kB | Adobe PDF | View/Open | |
07_list of tables.pdf | 19.53 kB | Adobe PDF | View/Open | |
08_chapter 1.pdf | 194.54 kB | Adobe PDF | View/Open | |
09_chapter 2.pdf | 293.98 kB | Adobe PDF | View/Open | |
10_chapter 3.pdf | 298.43 kB | Adobe PDF | View/Open | |
11_chapter 4.pdf | 625.65 kB | Adobe PDF | View/Open | |
12_chapter 5.pdf | 73.72 kB | Adobe PDF | View/Open | |
13_chapter 6.pdf | 49.59 kB | Adobe PDF | View/Open | |
14_bibliography.pdf | 88.81 kB | Adobe PDF | View/Open | |
15_abstract.pdf | 14.58 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: