Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/361231
Title: | Automatic Text Independent Speaker Recognition Using Voice and Lip Movement |
Researcher: | Nainan Sumita |
Guide(s): | Kulkarni Vaishali |
Keywords: | Engineering Engineering and Technology Engineering Electrical and Electronic |
University: | Narsee Monjee Institute of Management Studies |
Completed Date: | 2021 |
Abstract: | This thesis is focussed on the implication of combining multiple biometric traits on the newlineperformance of Text-Independent Automatic Speaker Recognition. A multimodal biometric newlinesystem combines more than one biometric trait acquired from a person which could be newlinephysiological or behavioural in nature. Besides the biometric traits, the sources of information newlineincluded in a multimodal biometric system can involve the multiple sensing devices employed, newlinemultiple instances of data collection and multiple algorithms for multiple traits. In this era of newlinedigitization, mobile devices have become affordable and offer ease of use. Additionally, the newlineavailability of cutting edge sensor technology has simplified the acquisition of biometric traits newlineand the focus of all transactions has moved to a digital platform. Implementing a secure and a newlinereliable biometric system which can identify, recognize, or verify an individual within newlineminimum time while safeguarding the privacy of an individual is challenging and remains an newlinearea of research. newlineAfter conducting an exhaustive literature survey, two major conclusions were drawn. Unimodal newlinesystem have limitations such as failure to enrol, limited flexibility, reduced security to spoof attacks amongst others, making multimodal system the need of the hour. The additional biometric trait to be employed to reinforce the voice parameter for speaker recognition was the other concern to be addressed. Voice acquisition being a non-invasive and easy to acquire trait newlineis supplemented with lip movement features to add liveness detection for implementing a newlinerobust speaker recognition system. Lip movement of a speaker being an extension of sound in the process of speech production besides being unique for every individual, made this as the choice for another biometric parameter.A standard audio-video database - VidTIMIT, which is derived from the N-TIMIT corpus, is chosen for this research to explore a text -independent speaker recognition system along with another database of 72 speakers created in the college. |
Pagination: | xix;170 |
URI: | http://hdl.handle.net/10603/361231 |
Appears in Departments: | Department of Electronic Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
03_certificate.pdf | Attached File | 749.59 kB | Adobe PDF | View/Open |
06_table of contents.pdf | 289.58 kB | Adobe PDF | View/Open | |
10_chapter 1.pdf | 780.52 kB | Adobe PDF | View/Open | |
11_chapter 2.pdf | 1.06 MB | Adobe PDF | View/Open | |
12_chapter 3.pdf | 2.43 MB | Adobe PDF | View/Open | |
13_chapter 4.pdf | 1.37 MB | Adobe PDF | View/Open | |
14_chapter 5.pdf | 781.64 kB | Adobe PDF | View/Open | |
15_chapter 6.pdf | 312.18 kB | Adobe PDF | View/Open | |
17_references.pdf | 554.89 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 129.41 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial 4.0 International (CC BY-NC 4.0).
Altmetric Badge: