Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/255794
Title: | Development of Spectro temporal Features of speech |
Researcher: | Dr. Sumanlata Gautam |
Guide(s): | Dr. Latika Singh |
Keywords: | Engineering and Technology,Computer Science,Computer Science Software Engineering |
University: | The Northcap University (Formerly ITM University, Gurgaon) |
Completed Date: | 2018 |
Abstract: | Speech and language development typically involves the ability to perceive the phonological structure of a language, relate this to the production of these sounds and finally produce the appropriate speech sounds for social communication. A deficiency in any of these processes could lead to language impairments notable in several neurodevelopment disorders like autism. Therefore, understanding acoustical basis of speech development can provide profound insights into the cognitive development of children. newline From a signal processing point of view, speech can be described as a time-pressure waveform. A wealth of research has shown that the auditory system acts as a frequency analyzer and the role of the frequency spectrum in encoding perceptual attributes of speech has been well established. Literature suggests that for a complete understanding of how various linguistic cues are encoded in a speech signal, it is important to adopt a spectro-temporal approach. Thus, spectro-temporal features of speech, being the phonological basis of perception in the auditory cortex, provide a relevant framework to study speech signal. newline In this thesis, a procedure to extract spectro-temporal features of speech has been provided. It is based on applying various filters on a spectrographic image to extract these features and determine their timescales. The statistics of these features of speech are also determined based on their timescales. Basically, two timescales were considered for this study; namely, long timescales of the order 100-500ms and short timescales of the order of 25-50ms. The long timescales represent supra-segmental features of speech like syllabicity, stress, tempo and rhythms. The short timescales represent segmental features like onset-rime units, phonemes, formants and their transitions. It is believed that these timescales represent different linguistic and phonological units. The proposed framework is then applied to a speech corpus consisting of 170 speech samples, where it is used to study development of speech production in normally developing children. The results show the presence of a significantly less number of features encoded at short timescales in children as compared to adults, but no significant difference is observed in long timescale features. The thesis also demonstrates the utility of this framework for studying speech impairments in children and adults with mild to moderate intellectual disabilities. It is evident from the results that many spectro-temporal features encoded at both the timescales are absent in subjects with intellectual disabilities; more prominently of shorter timescales. Finally, the differentiating power of spectro-temporal features is examined by training various classifiers using these features, which have yielded 95% success in classifying Intellectually Disabled. These results provide an initial step towards developing speech based early diagnostic tools and therapies newline newline |
Pagination: | 155p. |
URI: | http://hdl.handle.net/10603/255794 |
Appears in Departments: | Department of CSE & IT |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 14.72 kB | Adobe PDF | View/Open |
02_certificate.pdf | 444.12 kB | Adobe PDF | View/Open | |
04_content.pdf | 14.87 kB | Adobe PDF | View/Open | |
05_list_of_figures.pdf | 180.99 kB | Adobe PDF | View/Open | |
06_list_of_tables.pdf | 176.02 kB | Adobe PDF | View/Open | |
07_abstract.pdf | 177.67 kB | Adobe PDF | View/Open | |
08_chapter1.pdf | 303.46 kB | Adobe PDF | View/Open | |
09_chapter2.pdf | 727.06 kB | Adobe PDF | View/Open | |
10_chapter3.pdf | 190.04 kB | Adobe PDF | View/Open | |
11_chapter4.pdf | 1.15 MB | Adobe PDF | View/Open | |
12_chapter5.pdf | 1.06 MB | Adobe PDF | View/Open | |
13_chapter6.pdf | 999.09 kB | Adobe PDF | View/Open | |
14_chapter7.pdf | 337.35 kB | Adobe PDF | View/Open | |
15_appendix.pdf | 850.32 kB | Adobe PDF | View/Open | |
16_list_of_abbreviation.pdf | 179.99 kB | Adobe PDF | View/Open | |
17_reference.pdf | 473.27 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: