Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/426672
Title: Probabilistic source filter model of speech
Researcher: Achuth Rao, M V
Guide(s): Ghosh, Prasanta Kumar
Keywords: Engineering
Engineering and Technology
Engineering Electrical and Electronic
University: Indian Institute of Science Bangalore
Completed Date: 2020
Abstract: The human respiratory system plays a crucial role in breathing and swallow ing. However, it also plays an essential role in speech production, which is unique to humans. Speech production involves expelling air from the lungs. As the air flows from the lungs to the lips, some kinetic energy gets con verted to sound. Different structures modulate the generated sound, which is finally radiated out of the lips. The speech consists of various informa tion such as linguistic content, speaker identity, emotional state, accent, etc. Apart from speech, there are various scenarios where the sound is generated in the human respiratory system. These could be due to abnor malities in the muscles, motor control unit, or the lungs, which can directly affect generated speech as well. A variety of sounds are also generated by these structures while breathing including snoring, Stridor, Dysphagia, and Cough. The source filter (SF) model of speech is one of the earlier models of speech production. It assumes that speech is a result of filtering an excita tion or source signal by a linear filter. The source and filter are assumed to be independent. Even though the SF model represents the speech pro duction mechanism, there needs to be a tractable way of estimating the excitation and the filter. The estimation of both of them given speech falls under the general category of signal deconvolution problem, and, hence, there is no unique solution. There are several variations of the source-filter model in the literature by assuming different structures on the source/filter. There are various ways to estimate the parameters of the source and the filter. The estimated parameters are used in various speech applications such as automatic speech recognition, text to speech, speech enhancement etc. Even though the SF model is a model of speech production, it is used in applications including Parkinson s Disease classification, asthma classification...
Pagination: xx, 181 p.
URI: http://hdl.handle.net/10603/426672
Appears in Departments:Electrical Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File81.17 kBAdobe PDFView/Open
02_prelim pages.pdf672.46 kBAdobe PDFView/Open
03_contents.pdf278.09 kBAdobe PDFView/Open
04_abstract.pdf127.06 kBAdobe PDFView/Open
05_chapter 1.pdf389.21 kBAdobe PDFView/Open
06_chapter 2.pdf363.36 kBAdobe PDFView/Open
07_chapter 3.pdf424.71 kBAdobe PDFView/Open
08_chapter 4.pdf3.87 MBAdobe PDFView/Open
09_chapter 5.pdf703.46 kBAdobe PDFView/Open
10_chapter 6.pdf1.01 MBAdobe PDFView/Open
11_chapter 7.pdf422.26 kBAdobe PDFView/Open
12_chapter 8.pdf1.47 MBAdobe PDFView/Open
13_chapter r 9.pdf681.32 kBAdobe PDFView/Open
14_chapter 10.pdf631.33 kBAdobe PDFView/Open
16_annexure.pdf826.65 kBAdobe PDFView/Open
80_recommendation.pdf244.57 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: