Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/426363
Full metadata record
DC FieldValueLanguage
dc.date.accessioned2022-12-17T09:51:45Z-
dc.date.available2022-12-17T09:51:45Z-
dc.identifier.urihttp://hdl.handle.net/10603/426363-
dc.description.abstractRepresentation learning is the branch of machine learning consisting of techniques that are capable of automatically discovering meaningful representations from raw data for efficient information extraction. In recent years, following the trends in other streams of machine learning, representation learning using neural networks has attracted significant interest. For example, deep representation learning in the text domain using word embeddings has shown interesting semantic properties that make them widely useful for many natural language processing applications. In the speech processing field, representation learning has been a challenging task. This thesis is focused on developing neural methods for representation learning of speech and audio signals, with the goal of improving downstream applications that rely on these representations. For representation learning, we pursue two broad directions - supervised and unsupervised. In the case of speech/audio signals, we identify two stages of representation learning that are explored. The first stage is the learning of a time-frequency representation (the equivalent of spectrogram) from the raw audio waveform. The second stage is the learning of modulation representations (filtering the time-frequency representations along the temporal domain, called rate filtering and spectral domain, called scale filtering). In the first part of the thesis, we propose representation learning methods for speech data in an unsupervised manner. Using the modulation representation learning as the goal, we explore various neural architecture for unsupervised learning...-
dc.format.extentxxiii, 107p.-
dc.languageEnglish-
dc.rightsself-
dc.titleNeural Representation Learning for Speech and Audio Signals-
dc.title.alternativeNeural Representation Learning for Speech and Audio Signals-
dc.creator.researcherAgrawal, Purvi-
dc.subject.keywordEngineering-
dc.subject.keywordEngineering and Technology-
dc.subject.keywordEngineering Electrical and Electronic-
dc.contributor.guideGanapathy, Sriram-
dc.publisher.placeBangalore-
dc.publisher.universityIndian Institute of Science Bangalore-
dc.publisher.institutionElectrical Engineering-
dc.date.completed2020-
dc.date.awarded2021-
dc.format.dimensions30cm-
dc.format.accompanyingmaterialNone-
dc.source.universityUniversity-
dc.type.degreePh.D.-
Appears in Departments:Electrical Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File122.48 kBAdobe PDFView/Open
02_preliminary pages.pdf415.49 kBAdobe PDFView/Open
03_table of contents.pdf103.34 kBAdobe PDFView/Open
04_abstract.pdf129.12 kBAdobe PDFView/Open
05_chapter 1.pdf1.54 MBAdobe PDFView/Open
06_chapter 2.pdf1.25 MBAdobe PDFView/Open
07_chapter 3.pdf2.51 MBAdobe PDFView/Open
08_annexure.pdf282.59 kBAdobe PDFView/Open
80_recommendation.pdf346.79 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: