Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/356198
Title: Application of single frequency filtering for speech and speaker specific tasks
Researcher: P Vishala
Guide(s): B Yegnanarayana
Keywords: Engineering
Engineering and Technology
Engineering Electrical and Electronic
University: International Institute of Information Technology, Hyderabad
Completed Date: 2021
Abstract: Speech produced in practical environments is affected by several sources of degradations, which include noises, reflections, reverberation and sounds from other sources such as background speech, vehicles, etc. These degradations reduce the performance of several speech-based applications, such as speech and speaker recognition systems. It is difficult to characterize these degradations, as they are not known in advance, and also they are time varying. Hence, it is necessary to explore and extract speech-specific and speaker-specific information in the collected speech signal to improve the performance of speech-based systems. The objective of the studies proposed in this thesis is to explore methods based on single frequency filtering (SFF) analysis of speech signals and artificial neural network (ANN) models for some speech-specific and speaker-specific tasks. The SFF analysis gives flexibility in the representation of speech information at different spectral and temporal resolutions. The ANN models are used for capturing the implicit features, specific to a given application. The SFF analysis can be used to provide a representation of the excitation source and the vocal tract system characteristics of the dynamic speech production mechanism. The ANN models help in extracting the discriminative and descriptive features as needed for a given task. The speech-specific tasks explored in this thesis include speech/nonspeech detection from degraded speech, discrimination of natural and synthetic speech. The speaker-specific tasks examined in this thesis include speaker separation in two speaker conversation data, mimicked voice detection and speaker verification. The speech/nonspeech detection task is addressed using approaches based on signal processing and on ANN models. The signal processing approach exploits speech-specific characteristics in the degraded signal. The results of speech/nonspeech detection are given for the speech data from TIMIT corpus, corrupted by synthetically adding noises of different types
Pagination: 
URI: http://hdl.handle.net/10603/356198
Appears in Departments:Department of Electronic and Communication Engineering

Files in This Item:
File Description SizeFormat 
80_recommendation.pdfAttached File37.35 kBAdobe PDFView/Open
certificate (2).pdf28.29 kBAdobe PDFView/Open
chapter1 (1).pdf70.05 kBAdobe PDFView/Open
chapter2 (1).pdf2.59 MBAdobe PDFView/Open
chapter3 (1).pdf864.66 kBAdobe PDFView/Open
chapter4 (1).pdf2.24 MBAdobe PDFView/Open
chapter5 (1).pdf1.74 MBAdobe PDFView/Open
chapter6 (1).pdf4.03 MBAdobe PDFView/Open
chapter7 (1).pdf1.91 MBAdobe PDFView/Open
preliminarypages (1).pdf172.61 kBAdobe PDFView/Open
publicationslist.pdf26.15 kBAdobe PDFView/Open
titlepage (1).pdf125.93 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: