A framework for non parametric naïve bayes classification using opinion mining

Raja Rajeswari S

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/477398

Full metadata record

DC Field	Value	Language
dc.coverage.spatial	A framework for non parametric naïve bayes classification using opinion mining
dc.date.accessioned	2023-04-19T14:18:38Z	-
dc.date.available	2023-04-19T14:18:38Z	-
dc.identifier.uri	http://hdl.handle.net/10603/477398	-
dc.description.abstract	Opinion mining has gained much attention with the rapid growth of newlinesocial media. Polarity classification (Fersini et al. 2014) is a task of opinion newlinemining in which decisions are taken based on customer reviews and survey newlineresponses. The task of polarity classification is to classify the text document into newlinepositive or negative (Bijal et al. 2015 and Janardhana et al. 2015). This is newlineachieved by implementing the machine learning methods of classification. newlineGenerally, the opinion dataset contains a large number of features that operates newlineon a higher dimension. If all those features are considered, then it leads to poor newlineaccuracy of the classifier. Therefore, the dimension of the data must be reduced newlinebefore building the classifier model, which is carried out by transforming the newlinehigher dimension data into lower dimension by considering only the intrinsic newlineinformation of the data. The reduction of dimension can improve the robustness newlineof the classifier and reduces the time, and computational complexity. To classify newlinesuch a large volume of opinion dataset, the supervised machine learning newlinetechnique the Naive Bayes - Kernel Density Estimation (NB-KDE) is proposed. newlineThe classification of opinions can be divided into four stages. The newlinefirst stage involves pre-processing of the opinions. Generally, the text documents newlinecontain a large volume of data in which most of the words are irrelevant to the newlinecontent. Therefore, pre-processing is needed while classifying the text newlinedocuments. To handle the pre-process task efficiently, this system proposes the newlineStringToWordVector filter that has a number of parameters like stemming, newlinestopword removal and tokenizer. Tokenization (Muhammad et al. 2016) is the newlineact of dividing the strings into a number of tokens like words, symbols, and newlinephrases. It is the conventional method of text analysis to generate a basic unit of newlinewords. In WEKA, the WordTokenzier is used as a simple Tokenizer, which newlinegives the output as tokens. Stopwords are filtered out before classifying the text. newlineThe stopwords (Janardhana et al. 20
dc.format.extent	xvii,114p.
dc.language	English
dc.relation	p.103-113
dc.rights	university
dc.title	A framework for non parametric naïve bayes classification using opinion mining
dc.title.alternative
dc.creator.researcher	Raja Rajeswari S
dc.subject.keyword	Opinion mining
dc.subject.keyword	Improved Gain Ratio
dc.subject.keyword	Kernel Density Estimation
dc.description.note
dc.contributor.guide	John Sanjeev Kumar A
dc.publisher.place	Chennai
dc.publisher.university	Anna University
dc.publisher.institution	Faculty of Science and Humanities
dc.date.registered
dc.date.completed	2022
dc.date.awarded	2022
dc.format.dimensions	21cm
dc.format.accompanyingmaterial	None
dc.source.university	University
dc.type.degree	Ph.D.
Appears in Departments:	Faculty of Science and Humanities

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	9.42 kB	Adobe PDF	View/Open
02_prelimpages.pdf		536.67 kB	Adobe PDF	View/Open
03_contents.pdf		55.74 kB	Adobe PDF	View/Open
04_abstracts.pdf		9 kB	Adobe PDF	View/Open
05_chapter1.pdf		228.03 kB	Adobe PDF	View/Open
06_chapter2.pdf		404.54 kB	Adobe PDF	View/Open
07_chapter3.pdf		344.3 kB	Adobe PDF	View/Open
08_chapter4.pdf		398.04 kB	Adobe PDF	View/Open
09_chapter5.pdf		288.97 kB	Adobe PDF	View/Open
10_annexures.pdf		159.83 kB	Adobe PDF	View/Open
80_recommendation.pdf		85.74 kB	Adobe PDF	View/Open

Show simple item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET