Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/548352
Title: | An Efficient Approach for Automatic Text Summarization of Hindi Text |
Researcher: | Sunil Dhankhar |
Guide(s): | Mukesh Kumar Gupta |
Keywords: | Computer Science Computer Science Information Systems Engineering and Technology |
University: | Rajasthan Technical University, Kota |
Completed Date: | 2023 |
Abstract: | Due to the rise of the Internet, the number of digital documents has increased newlinesignificantly. This has led to the need for a text summarization system that can newlineautomatically produce a summary of the documents. The text summarization system newlinealso eliminates the need for manual work and time spent compiling the information. newlineBasically, a summary contains key phrases and other relevant text material without newlinealtering the source document s general context or key information. The process of newlinetext summarization started in 1958 and is still being studied by researchers. Text newlinesummarization systems are classified as extractive or abstractive. The extractive text newlinesummarization process aims to extract the most appropriate sentences and phrases newlinefrom documents. It then compiles these into a summary. In contrast, abstractive newlinesystems produce a summary of the documents to describe words other than the ones newlinecontained in an input document. There has been a significant amount of research on newlinedocuments written in English but relatively little on documents written in Hindi. newlineThis research aims for the Hindi language documents as Hindi is India s most newlinespoken language. From the literature, we have not found any efficient summarizing newlinesystem for Hindi language documents. newlineIn general, to produce a summary, first, extract the features from the newlinedocument s text, then compute the score of each document s sentence based on newlinefeature value, and then select the sentences with the highest score. The number of newlinesentences that can be included in a summary depends on the user-defined newlinecompression ratio. This research investigated several methods used in the literature newlineto rank each sentence of Hindi or English text documents. These are graph-based, newlinedeep learning-based, sematic-based, statistical-based, fuzzy-based, and machine newlinelearning-based methods. newline |
Pagination: | 3.08 mb |
URI: | http://hdl.handle.net/10603/548352 |
Appears in Departments: | Computer Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
80_recommendation.pdf | Attached File | 190.03 kB | Adobe PDF | View/Open |
abstract.pdf | 75.39 kB | Adobe PDF | View/Open | |
annexures.pdf | 311.86 kB | Adobe PDF | View/Open | |
chapter 1.pdf | 350.33 kB | Adobe PDF | View/Open | |
chapter 2.pdf | 541.09 kB | Adobe PDF | View/Open | |
chapter 3.pdf | 830.71 kB | Adobe PDF | View/Open | |
chapter 4.pdf | 448.46 kB | Adobe PDF | View/Open | |
chapter 5.pdf | 579.36 kB | Adobe PDF | View/Open | |
chapter 6.pdf | 7.5 kB | Adobe PDF | View/Open | |
contents.pdf | 123.84 kB | Adobe PDF | View/Open | |
prelim pages.pdf | 1.06 MB | Adobe PDF | View/Open | |
title.pdf | 20.85 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: