Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/255794
Title: Development of Spectro temporal Features of speech
Researcher: Dr. Sumanlata Gautam
Guide(s): Dr. Latika Singh
Keywords: Engineering and Technology,Computer Science,Computer Science Software Engineering
University: The Northcap University (Formerly ITM University, Gurgaon)
Completed Date: 2018
Abstract: Speech and language development typically involves the ability to perceive the phonological structure of a language, relate this to the production of these sounds and finally produce the appropriate speech sounds for social communication. A deficiency in any of these processes could lead to language impairments notable in several neurodevelopment disorders like autism. Therefore, understanding acoustical basis of speech development can provide profound insights into the cognitive development of children. newline From a signal processing point of view, speech can be described as a time-pressure waveform. A wealth of research has shown that the auditory system acts as a frequency analyzer and the role of the frequency spectrum in encoding perceptual attributes of speech has been well established. Literature suggests that for a complete understanding of how various linguistic cues are encoded in a speech signal, it is important to adopt a spectro-temporal approach. Thus, spectro-temporal features of speech, being the phonological basis of perception in the auditory cortex, provide a relevant framework to study speech signal. newline In this thesis, a procedure to extract spectro-temporal features of speech has been provided. It is based on applying various filters on a spectrographic image to extract these features and determine their timescales. The statistics of these features of speech are also determined based on their timescales. Basically, two timescales were considered for this study; namely, long timescales of the order 100-500ms and short timescales of the order of 25-50ms. The long timescales represent supra-segmental features of speech like syllabicity, stress, tempo and rhythms. The short timescales represent segmental features like onset-rime units, phonemes, formants and their transitions. It is believed that these timescales represent different linguistic and phonological units. The proposed framework is then applied to a speech corpus consisting of 170 speech samples, where it is used to study development of speech production in normally developing children. The results show the presence of a significantly less number of features encoded at short timescales in children as compared to adults, but no significant difference is observed in long timescale features. The thesis also demonstrates the utility of this framework for studying speech impairments in children and adults with mild to moderate intellectual disabilities. It is evident from the results that many spectro-temporal features encoded at both the timescales are absent in subjects with intellectual disabilities; more prominently of shorter timescales. Finally, the differentiating power of spectro-temporal features is examined by training various classifiers using these features, which have yielded 95% success in classifying Intellectually Disabled. These results provide an initial step towards developing speech based early diagnostic tools and therapies newline newline
Pagination: 155p.
URI: http://hdl.handle.net/10603/255794
Appears in Departments:Department of CSE & IT

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File14.72 kBAdobe PDFView/Open
02_certificate.pdf444.12 kBAdobe PDFView/Open
04_content.pdf14.87 kBAdobe PDFView/Open
05_list_of_figures.pdf180.99 kBAdobe PDFView/Open
06_list_of_tables.pdf176.02 kBAdobe PDFView/Open
07_abstract.pdf177.67 kBAdobe PDFView/Open
08_chapter1.pdf303.46 kBAdobe PDFView/Open
09_chapter2.pdf727.06 kBAdobe PDFView/Open
10_chapter3.pdf190.04 kBAdobe PDFView/Open
11_chapter4.pdf1.15 MBAdobe PDFView/Open
12_chapter5.pdf1.06 MBAdobe PDFView/Open
13_chapter6.pdf999.09 kBAdobe PDFView/Open
14_chapter7.pdf337.35 kBAdobe PDFView/Open
15_appendix.pdf850.32 kBAdobe PDFView/Open
16_list_of_abbreviation.pdf179.99 kBAdobe PDFView/Open
17_reference.pdf473.27 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: