Automatic Text Independent Speaker Recognition Using Voice and Lip Movement

Nainan Sumita

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/361231

Title:	Automatic Text Independent Speaker Recognition Using Voice and Lip Movement
Researcher:	Nainan Sumita
Guide(s):	Kulkarni Vaishali
Keywords:	Engineering Engineering and Technology Engineering Electrical and Electronic
University:	Narsee Monjee Institute of Management Studies
Completed Date:	2021
Abstract:	This thesis is focussed on the implication of combining multiple biometric traits on the newlineperformance of Text-Independent Automatic Speaker Recognition. A multimodal biometric newlinesystem combines more than one biometric trait acquired from a person which could be newlinephysiological or behavioural in nature. Besides the biometric traits, the sources of information newlineincluded in a multimodal biometric system can involve the multiple sensing devices employed, newlinemultiple instances of data collection and multiple algorithms for multiple traits. In this era of newlinedigitization, mobile devices have become affordable and offer ease of use. Additionally, the newlineavailability of cutting edge sensor technology has simplified the acquisition of biometric traits newlineand the focus of all transactions has moved to a digital platform. Implementing a secure and a newlinereliable biometric system which can identify, recognize, or verify an individual within newlineminimum time while safeguarding the privacy of an individual is challenging and remains an newlinearea of research. newlineAfter conducting an exhaustive literature survey, two major conclusions were drawn. Unimodal newlinesystem have limitations such as failure to enrol, limited flexibility, reduced security to spoof attacks amongst others, making multimodal system the need of the hour. The additional biometric trait to be employed to reinforce the voice parameter for speaker recognition was the other concern to be addressed. Voice acquisition being a non-invasive and easy to acquire trait newlineis supplemented with lip movement features to add liveness detection for implementing a newlinerobust speaker recognition system. Lip movement of a speaker being an extension of sound in the process of speech production besides being unique for every individual, made this as the choice for another biometric parameter.A standard audio-video database - VidTIMIT, which is derived from the N-TIMIT corpus, is chosen for this research to explore a text -independent speaker recognition system along with another database of 72 speakers created in the college.
Pagination:	xix;170
URI:	http://hdl.handle.net/10603/361231
Appears in Departments:	Department of Electronic Engineering

Files in This Item:

File	Description	Size	Format
03_certificate.pdf	Attached File	749.59 kB	Adobe PDF	View/Open
06_table of contents.pdf		289.58 kB	Adobe PDF	View/Open
10_chapter 1.pdf		780.52 kB	Adobe PDF	View/Open
11_chapter 2.pdf		1.06 MB	Adobe PDF	View/Open
12_chapter 3.pdf		2.43 MB	Adobe PDF	View/Open
13_chapter 4.pdf		1.37 MB	Adobe PDF	View/Open
14_chapter 5.pdf		781.64 kB	Adobe PDF	View/Open
15_chapter 6.pdf		312.18 kB	Adobe PDF	View/Open
17_references.pdf		554.89 kB	Adobe PDF	View/Open
80_recommendation.pdf		129.41 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial 4.0 International (CC BY-NC 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET