Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/427468
Title: | Hybrid deep learning based assessment and resynthesis of stuttered speech |
Researcher: | Sheena Christabel Pravin |
Guide(s): | Palanivelan M |
Keywords: | Engineering and Technology Computer Science Computer Science Information Systems Speech Fluency Disorder Stutter Like Disfluencies Typical Disfluencies Speech Language Pathologists Hybrid Deep Ensemble Fluent Speech Generative Adversarial Network Automatic Speech Recognizer |
University: | Anna University |
Completed Date: | 2021 |
Abstract: | Stuttering is a speech fluency disorder with precedents of sound prolongations silent blocks repetition of syllables words and phrases that disrupt the fluent rendition of speech Disfluency is common even among healthy individuals which could sometimes be mistaken for stuttering So it is important to categorize disfluencies into Stutter Like Disfluencies SLD and Typical Disfluencies TD The SLDs are crucial stutter markers which include word medial repetitions prolongations and silent blocks while the TDs include natural disfluencies like the filled pauses whole word repetitions phrase repetitions and revisions Presently the assessment of stuttering severity levels as mild or severe is being carried out by the Speech Language Pathologists SLPs by a manual count of SLDs This procedure is cumbersome and may be prone to error Alternatively machine learning models may be used for disfluency classification However the machine learning models yield reduced accuracy of classification than the deep learning models which require huge training data for improving accuracy So this research work has proposed hybrid deep learning models that yield high accuracy of classification even when trained over a sparse speech dataset at an optimal computational time complexity Further a hybrid deep learning generative model has been proposed to resynthesize speech for fluency enhancement newline |
Pagination: | xxiii, 153p. |
URI: | http://hdl.handle.net/10603/427468 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 28.98 kB | Adobe PDF | View/Open |
02_prelim_pages.pdf | 955.87 kB | Adobe PDF | View/Open | |
03_contents.pdf | 157.81 kB | Adobe PDF | View/Open | |
04_abstracts.pdf | 65.86 kB | Adobe PDF | View/Open | |
05_chapter1.pdf | 269.54 kB | Adobe PDF | View/Open | |
06_chapter2.pdf | 525.34 kB | Adobe PDF | View/Open | |
07_chapter3.pdf | 1.27 MB | Adobe PDF | View/Open | |
08_chapter4.pdf | 891.48 kB | Adobe PDF | View/Open | |
09_chapter5.pdf | 857.72 kB | Adobe PDF | View/Open | |
10_chapter6.pdf | 659.08 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 425.16 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 120.53 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: