Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/578097
Title: | A Framework for Multilingual Text Refining in Web Mining |
Researcher: | Lalima Choudhary |
Guide(s): | Bhavana Narain |
Keywords: | Computer Science Computer Science Artificial Intelligence Engineering and Technology |
University: | MATS University |
Completed Date: | 2020 |
Abstract: | newline Text mining, also known as text data mining or knowledge discovery from textual databases, refers generally to the process of extracting interesting and non-trivial patterns or knowledge from unstructured text documents. It can be viewed as an extension of data mining or knowledge discovery from (structured) databases. newlineIn search, the user is typically looking for something that is already known and has been written by someone else. The problem is pushing aside all the material that currently is not relevant to your needs in order to find the relevant information. newlineIn our thesis entitled A Framework for Multilingual Text Refining in Web Mining we have developed a Novel framework in which multilingual search content text (Hindi and English) is inserted in the form data. This input data is the base of text mining which has been performed in any web search engine. This framework process multilingual text documents and produce language-independent intermediate forms. Through this framework multilingual searching in Hindi and English ispossible newlineIn the development process of our framework we have collected data from whatsapp, twitter and line. These collected data were preprocessed and index is developed. The tool used for algorithm implementation is Weka 3.2. K-mean algorithm of Clustering is used and high page ranking is our main parameter to judge the performance. Beside high page ranking we have used response time and micro precision are our parameters of judgment. We have deployed our framework namely Novel framework in the form of search engine and compared its performance with Google, msn, ask and seekport. newline |
Pagination: | 228 |
URI: | http://hdl.handle.net/10603/578097 |
Appears in Departments: | MATS School of Information Technology |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title page.pdf | Attached File | 18.32 kB | Adobe PDF | View/Open |
02_preliminary pages.pdf | 2.84 MB | Adobe PDF | View/Open | |
03_table of content.pdf | 6.92 MB | Adobe PDF | View/Open | |
04_abstract.pdf | 42.4 kB | Adobe PDF | View/Open | |
05_chapter 01.pdf | 185.47 kB | Adobe PDF | View/Open | |
05_chapter 02.pdf | 387.78 kB | Adobe PDF | View/Open | |
07_chapter 03.pdf | 37.53 kB | Adobe PDF | View/Open | |
08_chapter 04.pdf | 450.91 kB | Adobe PDF | View/Open | |
09_chapter 05.pdf | 780.9 kB | Adobe PDF | View/Open | |
10_annuexures.pdf | 704.53 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 2.82 MB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: