Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/338681
Full metadata record
DC FieldValueLanguage
dc.coverage.spatialOcr framework for offline tamil handwritten scripts
dc.date.accessioned2021-09-02T04:19:14Z-
dc.date.available2021-09-02T04:19:14Z-
dc.identifier.urihttp://hdl.handle.net/10603/338681-
dc.description.abstractIn Pattern Recognition, Optical Character Recognition (OCR) is one of the popular and challenging research areas. OCR is a technology by which machine understandable code can be obtained from the character image. The applications of OCR include handwritten and printed ancient documents reading, such as handwritten characters in bank cheque, recognizing the characters from palm scripts, identifying the characters from inscriptions, and so on. This thesis focuses on recognizing the Offline handwritten characters of the cursive scripts collected from people written in the Tamil language. Even though Tamil is a South Indian language, it is also a highly speaking language in Canada, Singapore, Malaysia, SriLanka, and so on. The character sets of Tamil language consists of 12 vowels, 18 consonants, and 216 vowelized consonants and special characters. It has unique character sets to represent Tamil numbers, years, and months. The general structure of the Tamil language is versatile and cursive in nature. Numerous complexities are observed from the handwritten shapes of the Tamil Characterand#8223;s structure. They are, similar shapes introduced by writers, minor variations for the same character, characterand#8223;s structure connectivity, unnecessary curves, touching letters, shape variations, unwanted loops, angle variations leading to shape variations, unknown shaped characters similar in shape and character discontinuity etc. To provide the solution for these issues during the recognition of the versatile Tamil 247 character set, four interrelated component selection approaches are proposed in this thesis before feature extraction process. These component selections are facilitated to choose the proper characterand#8223;s structure before applying thefeature extraction algorithms. They are Zone and Junction Point-based Component Selection (ZJPCS), Junction Point-based Component Selection (JPCS), Zone-based Component Selection (ZCS), and a Junction Point Elimination based Component Selection (JPECS). To extract the required features from the character images, few test experiments are carried out with a proper analysis of various recognition works of Tamil and other language characters. The structures of the characters are connected with uneven components. By the way of human intervention, the characters are identified by the connectivity and location of the connected component. As per the feature selection in the Image Processing techniques, choosing the proper component is an essential and crucial way to recognize by a machine. Hence, it plays a major role in the proposed methodologies. For each component selection approaches, suitable feature extraction approaches are adapted to reveal the essential features from it. These features are extracted statistically (based on the pixel availability) and structurally (based on the shape properties). From these collections of feature extraction techniques attempted, four different frameworks are proposed and implemented in a hybrid way to implement a successful Tamil handwritten character recognition system. newline
dc.format.extentxxiii,217 p.
dc.languageEnglish
dc.relationp.204-216
dc.rightsuniversity
dc.titleOcr framework for offline tamil handwritten scripts
dc.title.alternative
dc.creator.researcherAntony Robert Raj, M
dc.subject.keywordTamil handwritten
dc.subject.keywordOcr
dc.subject.keywordPattern recognition
dc.description.note
dc.contributor.guideAbirami, S
dc.publisher.placeChennai
dc.publisher.universityAnna University
dc.publisher.institutionFaculty of Information and Communication Engineering
dc.date.registered
dc.date.completed2020
dc.date.awarded2020
dc.format.dimensions21cm
dc.format.accompanyingmaterialNone
dc.source.universityUniversity
dc.type.degreePh.D.
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File27.59 kBAdobe PDFView/Open
02_certificates.pdf135.5 kBAdobe PDFView/Open
03_vivaproceedings.pdf373.39 kBAdobe PDFView/Open
04_bonafidecertificate.pdf1.03 MBAdobe PDFView/Open
05_abstracts.pdf127.79 kBAdobe PDFView/Open
06_acknowledgements.pdf59.64 kBAdobe PDFView/Open
07_contents.pdf129.68 kBAdobe PDFView/Open
08_listoftables.pdf129.49 kBAdobe PDFView/Open
09_listoffigures.pdf411.72 kBAdobe PDFView/Open
10_listofabbreviations.pdf15.53 kBAdobe PDFView/Open
11_chapter1.pdf728.08 kBAdobe PDFView/Open
12_chapter2.pdf582.68 kBAdobe PDFView/Open
13_chapter3.pdf699.31 kBAdobe PDFView/Open
14_chapter4.pdf3.27 MBAdobe PDFView/Open
15_chapter5.pdf1.65 MBAdobe PDFView/Open
16_chapter6.pdf1.73 MBAdobe PDFView/Open
17_chapter7.pdf2.33 MBAdobe PDFView/Open
18_conclusion.pdf227.89 kBAdobe PDFView/Open
19_references.pdf189.04 kBAdobe PDFView/Open
20_listofpublications.pdf136.06 kBAdobe PDFView/Open
80_recommendation.pdf95.09 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: