Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/397691
Title: Design of spoof speech detection system teager energy based approach
Researcher: Kamble, Madhu R.
Guide(s): Patil, Hemant A.
Keywords: Engineering and Technology
Computer Science
Computer Science Information Systems
Verification (Logic)
Speech synthesis
Computer input-output equipment
Speech processing systems
Detection
Energy conversion
Computer algorithms
University: Dhirubhai Ambani Institute of Information and Communication Technology (DA-IICT)
Completed Date: 2021
Abstract: Automatic Speaker Verification (ASV) systems are vulnerable to various spoofing attacks, namely, Speech Synthesis (SS), Voice Conversion (VC), Replay, and Impersonation. The study of spoofing countermeasures has become increasingly important and is currently a critical area of research, which is the principal objective of this thesis. With the development of Neural Network-based techniques, in particular, for machine generated spoof speech signals, the performance of Spoof Speech Detection (SSD) system will be further challenging. To encourage the development of countermeasures that are based on signal processing techniques or neural network-based features for SSD task, a standardized dataset was provided by the organizers of ASVspoof challenge campaigns during 2015, 2017, and 2019. The front-end features extracted from the speech signal has a huge impact in the field of signal processing applications. The goal of feature extraction is to estimate the meaningful information directly from the speech signal that can be helpful to the pattern classifier, speech, speaker, emotion recognition, etc. Among various spoofing attacks, speech synthesis, voice conversion, and replay attacks have been identified as the most effective and accessible forms of spoofing. Accordingly, this thesis investigates and develops a framework to extract the discriminative features to deflect these three spoofing attacks. The main contribution of the thesis is to propose various feature sets as frontend countermeasures for SSD task using a traditional Gaussian Mixture Model (GMM)-based classification system. The feature sets are based on Teager Energy Operator (TEO) and Energy Separation Algorithm (ESA), namely, Teager Energy Cepstral Coefficients (TECC), Energy Separation Algorithm Instantaneous Frequency Cepstral Coefficients (ESA-IFCC), Energy Separation Algorithm Instantaneous Amplitude Cepstral Coefficients (ESA-IACC), Amplitude Weighted Frequency Cepstral Coefficients (AWFCC), Gabor Teager Filterbank (GTFB). The motivation behind...
Pagination: xxxvii, 226 p.
URI: http://hdl.handle.net/10603/397691
Appears in Departments:Department of Information and Communication Technology

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File84.38 kBAdobe PDFView/Open
02_declaration and certificate.pdf201.16 kBAdobe PDFView/Open
03_acknowledgments.pdf63.58 kBAdobe PDFView/Open
04_contents.pdf100.37 kBAdobe PDFView/Open
05_abstract.pdf85.26 kBAdobe PDFView/Open
06_list of acronyms, symbols, tables and figures.pdf172.39 kBAdobe PDFView/Open
07_chapter 1.pdf683.88 kBAdobe PDFView/Open
08_chapter 2.pdf761.05 kBAdobe PDFView/Open
09_chapter 3.pdf2.09 MBAdobe PDFView/Open
10_chapter 4.pdf1.29 MBAdobe PDFView/Open
11_chapter 5.pdf1.3 MBAdobe PDFView/Open
12_chapter 6.pdf2.38 MBAdobe PDFView/Open
13_chapter 7.pdf2.18 MBAdobe PDFView/Open
14_chapter 8.pdf257.03 kBAdobe PDFView/Open
15_appendix.pdf253.96 kBAdobe PDFView/Open
16_bibliography.pdf144.09 kBAdobe PDFView/Open
17_list of publication and brief biography.pdf896.8 kBAdobe PDFView/Open
80_recommendation.pdf296.71 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: