Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/507388
Title: | Detection of Vowels in Speech Signal with Application to Children s Keyword Spotting |
Researcher: | Garnaik, Sarmila |
Guide(s): | Sethi, Kabiraj and Pradhan, Gayadhar |
Keywords: | Engineering Engineering and Technology Engineering Electrical and Electronic KWS system pitch robust spectral processing spectral smoothing temporal processing vowel endpoints Vowel onset points |
University: | Veer Surendra Sai University of Technology |
Completed Date: | 2023 |
Abstract: | The vowel onset points (VOPs) and vowel end points (VEPs) are newlinetwo major events in a speech signal. The vowels can be classified newlinefrom other non-vowels anchoring VOPs and VEPs. In most cases, newlinethe signal transition at the VEP is slow compared to the VOP, which newlinealso depends on bounding sound units. Consequently, the detection of newlineVOPs and VEPs using a single algorithm is challenging. The detection newlineof these events is also affected by ambient noise. This thesis work aims newlineat extracting scalar features from the speech signal for simultaneously newlineenhancing the detection of VOP and VEP events for the classification newlineof vowels in a running speech. Then, by classifying each analysis frame newlineas vowel or non-vowel, a non-uniform spectral smoothing approach newlineis proposed for the development of a pitch-robust keyword spotting newlinesystem.The salient contributions of this thesis work are as follows: newlinei.) A novel temporal processing approach for extracting temporal newlinedynamics from speech signal for enhancing detection accuracy of newlineVOPs and VEPs through a single algorithm. newlineii.) An approach using variational mode decomposition (VMD) for newlinesuppressing the effect of ambient noise and reconstruction of the newlinesmoothed magnitude spectra to represent the resonance structure newlineof the vocal-tract system to detect VOPs and VEPs. newlineiii.) A robust scalar feature by extracting variation of short-term newlinemagnitude spectra (ST-MS). newlineiv.) A spectral processing approach for detection of vowels following newlinethe cepstral feature extraction process, to reduce the overall computation newlinein the development of speech-based applications using newlinevowel event. newlinev.) An approach for enhancing detection of VEPs using non-linear newlinemapping. newlinevi.) A non-uniform spectral smoothing approach using knowledge of newlinevowels for development of pitch robust KWS system. |
Pagination: | |
URI: | http://hdl.handle.net/10603/507388 |
Appears in Departments: | Department of Electronics and Telecommunication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 111.5 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 162.45 kB | Adobe PDF | View/Open | |
03_content.pdf | 136.24 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 34.61 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 637.16 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 281.63 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 1.45 MB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 1.36 MB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 374.7 kB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 189.22 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 177.56 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 137.15 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: