Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/35514
Full metadata record
DC FieldValueLanguage
dc.coverage.spatialPattern based bootstrapping approaches For natural language processing of Morphologically rich languagesen_US
dc.date.accessioned2015-02-17T11:40:08Z-
dc.date.available2015-02-17T11:40:08Z-
dc.date.issued2015-02-17-
dc.identifier.urihttp://hdl.handle.net/10603/35514-
dc.description.abstractThis thesis attempts to tackle Natural Language Processing NLP tasks by exploiting the special characteristics of morphologically rich newlineLanguages In this thesis we use Tamil as an example to show how newlinecomputational approaches to such morphologically rich languages need to be newlinedifferent Our initial work used the special characteristics to build rule based newlinesystems However as is the case with most rule based systems only the newlinenatural language sentences of a specific domain could be tackled As a result newlineof our experience in building the rule based systems we were able to identify newlinethe linguistic features that could be effectively used for the NLP processing of newlinemorphologically rich languages newlineIn order to overcome the limitations of rule based approaches we newlinenext attempted to explore machine learning approaches One of the common newlinemachine learning approaches used for languages such as English, is newlinesupervised learning Supervised approaches require a large labor intensive newlineannotated and labeled corpus which is not available for resource scarce newlinelanguages such as Tamil Unsupervised approaches on the other hand take a newlinelong time to converge to a solution We first attempted an unsupervised approach newlinefor the semantic relation extraction From our experience with the unsupervised newlineapproach we found that the partially free word order characteristic of a newlinemorphologically rich language did not lend itself to fast convergence to a newlinesolution In this context we decided that semi supervised approaches that require a limited number of trained samples could be attempted newline newlineen_US
dc.format.extentxxv, 304p.en_US
dc.languageEnglishen_US
dc.relationp284-303.en_US
dc.rightsuniversityen_US
dc.titlePattern based bootstrapping approaches For natural language processing of Morphologically rich languagesen_US
dc.title.alternativeen_US
dc.creator.researcherBalaji Jen_US
dc.subject.keywordNatural Language Processingen_US
dc.description.notereference p284-303.en_US
dc.contributor.guideGeetha T Ven_US
dc.publisher.placeChennaien_US
dc.publisher.universityAnna Universityen_US
dc.publisher.institutionFaculty of Information and Communication Engineeringen_US
dc.date.registeredn.d,en_US
dc.date.completed01/10/2014en_US
dc.date.awarded30/10/2014en_US
dc.format.dimensions23cm.en_US
dc.format.accompanyingmaterialNoneen_US
dc.source.universityUniversityen_US
dc.type.degreePh.D.en_US
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File51.66 kBAdobe PDFView/Open
02_certificate.pdf1.28 MBAdobe PDFView/Open
03_abstract.pdf33.99 kBAdobe PDFView/Open
04_acknowledgement.pdf21.53 kBAdobe PDFView/Open
05_content.pdf67.8 kBAdobe PDFView/Open
06_chapter1.pdf143.67 kBAdobe PDFView/Open
07_chapter2.pdf259.05 kBAdobe PDFView/Open
08_chapter3.pdf877.54 kBAdobe PDFView/Open
09_chapter4.pdf1.03 MBAdobe PDFView/Open
10_chapter5.pdf694.99 kBAdobe PDFView/Open
11_chapter6.pdf1.5 MBAdobe PDFView/Open
12_chapter7.pdf1.83 MBAdobe PDFView/Open
13_chapter8.pdf24.56 kBAdobe PDFView/Open
14_reference.pdf94.41 kBAdobe PDFView/Open
15_publication.pdf37.24 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: