Detection and segmentation of touching handwritten Gurmukhi script

Rajiv, Kumar

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/4448

Title:	Detection and segmentation of touching handwritten Gurmukhi script
Researcher:	Rajiv, Kumar
Guide(s):	Singh, Amardeep
Keywords:	Segmentation Natural Language Automatic Document Processing Character Recognition Optical Character Recognition (OCR) Computer Engineering
Upload Date:	31-Aug-2012
University:	Punjabi University
Completed Date:	July, 2011
Abstract:	The scanned image of the text is not of any use for user, because that image is not editable. One can not make any change if required to the scanned document. This provides a food for thought for the theory of optical character recognition (OCR). OCR is nothing but character recognition of a segmented part of the scanned image. The overall OCR process consists of three major sub processes like pre processing, segmentation and then recognition. Out of these three, the segmentation process is the back bone of the overall OCR process. We can say that the segmentation process is the most significant process because if the segmentation is incorrect then we can not have the correct results; it is just like garbage in and garbage out. Therefore the segmented part of the image would be such that it should provide a close relation to the character to be recognized. Hence segmentation plays an important role in the OCR process. There are problems in segmentation process, but the degree of the problems varies from script to script, that is, the problem set for segmentation of the text written in a particular script may differ than the problem set for the text written in other scripts. The characteristics of the script, plays a significant role in deciding the segmentation points. But it is not an easy job, because segmentation is one of the complex processes. It is more difficult if the document is handwritten because in that case only few points are there which can be used to make segmentation. In the present study, we formulate an approach to segment the scanned document image. As per this approach, initially this considers the whole image as one large window. Then this large window is broken into less large windows giving lines, once the lines are identified then each window consisting of a line is used to find a word present in that line. For that purpose we used the concept of variable sized window, that is, the window whose size can be adjusted according to needs.
Pagination:	ii, 144p.
URI:	http://hdl.handle.net/10603/4448
Appears in Departments:	University College of Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	21.78 kB	Adobe PDF	View/Open
02_certificate.pdf		15.39 kB	Adobe PDF	View/Open
03_declaration.pdf		15.38 kB	Adobe PDF	View/Open
04_acknowledgements.pdf		21.26 kB	Adobe PDF	View/Open
05_index.pdf		25.95 kB	Adobe PDF	View/Open
06_list of figures.pdf		27.6 kB	Adobe PDF	View/Open
07_list of tables.pdf		19.53 kB	Adobe PDF	View/Open
08_chapter 1.pdf		194.54 kB	Adobe PDF	View/Open
09_chapter 2.pdf		293.98 kB	Adobe PDF	View/Open
10_chapter 3.pdf		298.43 kB	Adobe PDF	View/Open
11_chapter 4.pdf		625.65 kB	Adobe PDF	View/Open
12_chapter 5.pdf		73.72 kB	Adobe PDF	View/Open
13_chapter 6.pdf		49.59 kB	Adobe PDF	View/Open
14_bibliography.pdf		88.81 kB	Adobe PDF	View/Open
15_abstract.pdf		14.58 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET