Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/426672
Title: | Probabilistic source filter model of speech |
Researcher: | Achuth Rao, M V |
Guide(s): | Ghosh, Prasanta Kumar |
Keywords: | Engineering Engineering and Technology Engineering Electrical and Electronic |
University: | Indian Institute of Science Bangalore |
Completed Date: | 2020 |
Abstract: | The human respiratory system plays a crucial role in breathing and swallow ing. However, it also plays an essential role in speech production, which is unique to humans. Speech production involves expelling air from the lungs. As the air flows from the lungs to the lips, some kinetic energy gets con verted to sound. Different structures modulate the generated sound, which is finally radiated out of the lips. The speech consists of various informa tion such as linguistic content, speaker identity, emotional state, accent, etc. Apart from speech, there are various scenarios where the sound is generated in the human respiratory system. These could be due to abnor malities in the muscles, motor control unit, or the lungs, which can directly affect generated speech as well. A variety of sounds are also generated by these structures while breathing including snoring, Stridor, Dysphagia, and Cough. The source filter (SF) model of speech is one of the earlier models of speech production. It assumes that speech is a result of filtering an excita tion or source signal by a linear filter. The source and filter are assumed to be independent. Even though the SF model represents the speech pro duction mechanism, there needs to be a tractable way of estimating the excitation and the filter. The estimation of both of them given speech falls under the general category of signal deconvolution problem, and, hence, there is no unique solution. There are several variations of the source-filter model in the literature by assuming different structures on the source/filter. There are various ways to estimate the parameters of the source and the filter. The estimated parameters are used in various speech applications such as automatic speech recognition, text to speech, speech enhancement etc. Even though the SF model is a model of speech production, it is used in applications including Parkinson s Disease classification, asthma classification... |
Pagination: | xx, 181 p. |
URI: | http://hdl.handle.net/10603/426672 |
Appears in Departments: | Electrical Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 81.17 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 672.46 kB | Adobe PDF | View/Open | |
03_contents.pdf | 278.09 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 127.06 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 389.21 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 363.36 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 424.71 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 3.87 MB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 703.46 kB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 1.01 MB | Adobe PDF | View/Open | |
11_chapter 7.pdf | 422.26 kB | Adobe PDF | View/Open | |
12_chapter 8.pdf | 1.47 MB | Adobe PDF | View/Open | |
13_chapter r 9.pdf | 681.32 kB | Adobe PDF | View/Open | |
14_chapter 10.pdf | 631.33 kB | Adobe PDF | View/Open | |
16_annexure.pdf | 826.65 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 244.57 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: