Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/426363
Title: | Neural Representation Learning for Speech and Audio Signals |
Researcher: | Agrawal, Purvi |
Guide(s): | Ganapathy, Sriram |
Keywords: | Engineering Engineering and Technology Engineering Electrical and Electronic |
University: | Indian Institute of Science Bangalore |
Completed Date: | 2020 |
Abstract: | Representation learning is the branch of machine learning consisting of techniques that are capable of automatically discovering meaningful representations from raw data for efficient information extraction. In recent years, following the trends in other streams of machine learning, representation learning using neural networks has attracted significant interest. For example, deep representation learning in the text domain using word embeddings has shown interesting semantic properties that make them widely useful for many natural language processing applications. In the speech processing field, representation learning has been a challenging task. This thesis is focused on developing neural methods for representation learning of speech and audio signals, with the goal of improving downstream applications that rely on these representations. For representation learning, we pursue two broad directions - supervised and unsupervised. In the case of speech/audio signals, we identify two stages of representation learning that are explored. The first stage is the learning of a time-frequency representation (the equivalent of spectrogram) from the raw audio waveform. The second stage is the learning of modulation representations (filtering the time-frequency representations along the temporal domain, called rate filtering and spectral domain, called scale filtering). In the first part of the thesis, we propose representation learning methods for speech data in an unsupervised manner. Using the modulation representation learning as the goal, we explore various neural architecture for unsupervised learning... |
Pagination: | xxiii, 107p. |
URI: | http://hdl.handle.net/10603/426363 |
Appears in Departments: | Electrical Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 122.48 kB | Adobe PDF | View/Open |
02_preliminary pages.pdf | 415.49 kB | Adobe PDF | View/Open | |
03_table of contents.pdf | 103.34 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 129.12 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 1.54 MB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 1.25 MB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 2.51 MB | Adobe PDF | View/Open | |
08_annexure.pdf | 282.59 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 346.79 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: