Probabilistic source filter model of speech

Achuth Rao, M V

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/426672

Title:	Probabilistic source filter model of speech
Researcher:	Achuth Rao, M V
Guide(s):	Ghosh, Prasanta Kumar
Keywords:	Engineering Engineering and Technology Engineering Electrical and Electronic
University:	Indian Institute of Science Bangalore
Completed Date:	2020
Abstract:	The human respiratory system plays a crucial role in breathing and swallow ing. However, it also plays an essential role in speech production, which is unique to humans. Speech production involves expelling air from the lungs. As the air flows from the lungs to the lips, some kinetic energy gets con verted to sound. Different structures modulate the generated sound, which is finally radiated out of the lips. The speech consists of various informa tion such as linguistic content, speaker identity, emotional state, accent, etc. Apart from speech, there are various scenarios where the sound is generated in the human respiratory system. These could be due to abnor malities in the muscles, motor control unit, or the lungs, which can directly affect generated speech as well. A variety of sounds are also generated by these structures while breathing including snoring, Stridor, Dysphagia, and Cough. The source filter (SF) model of speech is one of the earlier models of speech production. It assumes that speech is a result of filtering an excita tion or source signal by a linear filter. The source and filter are assumed to be independent. Even though the SF model represents the speech pro duction mechanism, there needs to be a tractable way of estimating the excitation and the filter. The estimation of both of them given speech falls under the general category of signal deconvolution problem, and, hence, there is no unique solution. There are several variations of the source-filter model in the literature by assuming different structures on the source/filter. There are various ways to estimate the parameters of the source and the filter. The estimated parameters are used in various speech applications such as automatic speech recognition, text to speech, speech enhancement etc. Even though the SF model is a model of speech production, it is used in applications including Parkinson s Disease classification, asthma classification...
Pagination:	xx, 181 p.
URI:	http://hdl.handle.net/10603/426672
Appears in Departments:	Electrical Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	81.17 kB	Adobe PDF	View/Open
02_prelim pages.pdf		672.46 kB	Adobe PDF	View/Open
03_contents.pdf		278.09 kB	Adobe PDF	View/Open
04_abstract.pdf		127.06 kB	Adobe PDF	View/Open
05_chapter 1.pdf		389.21 kB	Adobe PDF	View/Open
06_chapter 2.pdf		363.36 kB	Adobe PDF	View/Open
07_chapter 3.pdf		424.71 kB	Adobe PDF	View/Open
08_chapter 4.pdf		3.87 MB	Adobe PDF	View/Open
09_chapter 5.pdf		703.46 kB	Adobe PDF	View/Open
10_chapter 6.pdf		1.01 MB	Adobe PDF	View/Open
11_chapter 7.pdf		422.26 kB	Adobe PDF	View/Open
12_chapter 8.pdf		1.47 MB	Adobe PDF	View/Open
13_chapter r 9.pdf		681.32 kB	Adobe PDF	View/Open
14_chapter 10.pdf		631.33 kB	Adobe PDF	View/Open
16_annexure.pdf		826.65 kB	Adobe PDF	View/Open
80_recommendation.pdf		244.57 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET