Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/444870
Title: Shouted, Overlapped and Competitive Speech Detection in Indian Television News Debates
Researcher: Baghel, Shikha
Guide(s): Guha, Prithwijit and Prasanna, S R Mahadeva
Keywords: Engineering
Engineering and Technology
Engineering Electrical and Electronic
University: Indian Institute of Technology Guwahati
Completed Date: 2022
Abstract: "Television (TV) news debates present expert opinions, analysis and discussions on contemporary events. These debates play a critical role in navigating public belief and understanding of socio-politically relevant topics. This encourages several agencies to analyze the TV news debate content for monitoring their influence. The availability of huge (and ever increasing) amount of news debate data calls for the necessity of automatic content analysis. TV news debates are generally argumentative in nature. Such arguments are often associated with the presence of shouted, overlapped, and competitive speech. In this context, the present thesis aims to detect these three speech categories in Indian TV news debates. The first contribution of this thesis is the development of an Indian Broadcast News Debate (IBND) corpus containing audio signals from 15 news debates (approximately 13 hours). A multi-level annotation procedure was followed to obtain the final annotations for the three targeted tasks of the thesis. The second contribution lies in the proposal of excitation source based Shouted Speech Detection (SSD). Both handcrafted and learned features from excitation source-based representations are explored for SSD. An autoencoder with Bi-GRU based architecture is used as classifier. The third aim of the thesis is to identify the overlapped speech in TV news debates. Phase-based representations of the speech signals are established as efficient features for Overlapped Speech Detection (OSD) using a CNN-LSTM based classifier. Finally, the shouted and overlapped speech classification network embeddings and their prediction scores are used as features to identify the competitive speech. It has been shown that the detection of competitive speech can be performed efficiently using high-level information of both shouted and overlapped speech."
Pagination: Not Available
URI: http://hdl.handle.net/10603/444870
Appears in Departments:DEPARTMENT OF ELECTRONICS AND ELECTRICAL ENGINEERING

Files in This Item:
File Description SizeFormat 
01_fulltext.pdfAttached File6.45 MBAdobe PDFView/Open
04_abstract.pdf689.57 kBAdobe PDFView/Open
80_recommendation.pdf225.5 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: