Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/548352
Title: An Efficient Approach for Automatic Text Summarization of Hindi Text
Researcher: Sunil Dhankhar
Guide(s): Mukesh Kumar Gupta
Keywords: Computer Science
Computer Science Information Systems
Engineering and Technology
University: Rajasthan Technical University, Kota
Completed Date: 2023
Abstract: Due to the rise of the Internet, the number of digital documents has increased newlinesignificantly. This has led to the need for a text summarization system that can newlineautomatically produce a summary of the documents. The text summarization system newlinealso eliminates the need for manual work and time spent compiling the information. newlineBasically, a summary contains key phrases and other relevant text material without newlinealtering the source document s general context or key information. The process of newlinetext summarization started in 1958 and is still being studied by researchers. Text newlinesummarization systems are classified as extractive or abstractive. The extractive text newlinesummarization process aims to extract the most appropriate sentences and phrases newlinefrom documents. It then compiles these into a summary. In contrast, abstractive newlinesystems produce a summary of the documents to describe words other than the ones newlinecontained in an input document. There has been a significant amount of research on newlinedocuments written in English but relatively little on documents written in Hindi. newlineThis research aims for the Hindi language documents as Hindi is India s most newlinespoken language. From the literature, we have not found any efficient summarizing newlinesystem for Hindi language documents. newlineIn general, to produce a summary, first, extract the features from the newlinedocument s text, then compute the score of each document s sentence based on newlinefeature value, and then select the sentences with the highest score. The number of newlinesentences that can be included in a summary depends on the user-defined newlinecompression ratio. This research investigated several methods used in the literature newlineto rank each sentence of Hindi or English text documents. These are graph-based, newlinedeep learning-based, sematic-based, statistical-based, fuzzy-based, and machine newlinelearning-based methods. newline
Pagination: 3.08 mb
URI: http://hdl.handle.net/10603/548352
Appears in Departments:Computer Engineering

Files in This Item:
File Description SizeFormat 
80_recommendation.pdfAttached File190.03 kBAdobe PDFView/Open
abstract.pdf75.39 kBAdobe PDFView/Open
annexures.pdf311.86 kBAdobe PDFView/Open
chapter 1.pdf350.33 kBAdobe PDFView/Open
chapter 2.pdf541.09 kBAdobe PDFView/Open
chapter 3.pdf830.71 kBAdobe PDFView/Open
chapter 4.pdf448.46 kBAdobe PDFView/Open
chapter 5.pdf579.36 kBAdobe PDFView/Open
chapter 6.pdf7.5 kBAdobe PDFView/Open
contents.pdf123.84 kBAdobe PDFView/Open
prelim pages.pdf1.06 MBAdobe PDFView/Open
title.pdf20.85 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: