An Efficient Approach for Automatic Text Summarization of Hindi Text

Sunil Dhankhar

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/548352

Title:	An Efficient Approach for Automatic Text Summarization of Hindi Text
Researcher:	Sunil Dhankhar
Guide(s):	Mukesh Kumar Gupta
Keywords:	Computer Science Computer Science Information Systems Engineering and Technology
University:	Rajasthan Technical University, Kota
Completed Date:	2023
Abstract:	Due to the rise of the Internet, the number of digital documents has increased newlinesignificantly. This has led to the need for a text summarization system that can newlineautomatically produce a summary of the documents. The text summarization system newlinealso eliminates the need for manual work and time spent compiling the information. newlineBasically, a summary contains key phrases and other relevant text material without newlinealtering the source document s general context or key information. The process of newlinetext summarization started in 1958 and is still being studied by researchers. Text newlinesummarization systems are classified as extractive or abstractive. The extractive text newlinesummarization process aims to extract the most appropriate sentences and phrases newlinefrom documents. It then compiles these into a summary. In contrast, abstractive newlinesystems produce a summary of the documents to describe words other than the ones newlinecontained in an input document. There has been a significant amount of research on newlinedocuments written in English but relatively little on documents written in Hindi. newlineThis research aims for the Hindi language documents as Hindi is India s most newlinespoken language. From the literature, we have not found any efficient summarizing newlinesystem for Hindi language documents. newlineIn general, to produce a summary, first, extract the features from the newlinedocument s text, then compute the score of each document s sentence based on newlinefeature value, and then select the sentences with the highest score. The number of newlinesentences that can be included in a summary depends on the user-defined newlinecompression ratio. This research investigated several methods used in the literature newlineto rank each sentence of Hindi or English text documents. These are graph-based, newlinedeep learning-based, sematic-based, statistical-based, fuzzy-based, and machine newlinelearning-based methods. newline
Pagination:	3.08 mb
URI:	http://hdl.handle.net/10603/548352
Appears in Departments:	Computer Engineering

Files in This Item:

File	Description	Size	Format
80_recommendation.pdf	Attached File	190.03 kB	Adobe PDF	View/Open
abstract.pdf		75.39 kB	Adobe PDF	View/Open
annexures.pdf		311.86 kB	Adobe PDF	View/Open
chapter 1.pdf		350.33 kB	Adobe PDF	View/Open
chapter 2.pdf		541.09 kB	Adobe PDF	View/Open
chapter 3.pdf		830.71 kB	Adobe PDF	View/Open
chapter 4.pdf		448.46 kB	Adobe PDF	View/Open
chapter 5.pdf		579.36 kB	Adobe PDF	View/Open
chapter 6.pdf		7.5 kB	Adobe PDF	View/Open
contents.pdf		123.84 kB	Adobe PDF	View/Open
prelim pages.pdf		1.06 MB	Adobe PDF	View/Open
title.pdf		20.85 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET