Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/361231
Full metadata record
DC FieldValueLanguage
dc.coverage.spatialxix;170
dc.date.accessioned2022-02-10T06:07:12Z-
dc.date.available2022-02-10T06:07:12Z-
dc.identifier.urihttp://hdl.handle.net/10603/361231-
dc.description.abstractThis thesis is focussed on the implication of combining multiple biometric traits on the newlineperformance of Text-Independent Automatic Speaker Recognition. A multimodal biometric newlinesystem combines more than one biometric trait acquired from a person which could be newlinephysiological or behavioural in nature. Besides the biometric traits, the sources of information newlineincluded in a multimodal biometric system can involve the multiple sensing devices employed, newlinemultiple instances of data collection and multiple algorithms for multiple traits. In this era of newlinedigitization, mobile devices have become affordable and offer ease of use. Additionally, the newlineavailability of cutting edge sensor technology has simplified the acquisition of biometric traits newlineand the focus of all transactions has moved to a digital platform. Implementing a secure and a newlinereliable biometric system which can identify, recognize, or verify an individual within newlineminimum time while safeguarding the privacy of an individual is challenging and remains an newlinearea of research. newlineAfter conducting an exhaustive literature survey, two major conclusions were drawn. Unimodal newlinesystem have limitations such as failure to enrol, limited flexibility, reduced security to spoof attacks amongst others, making multimodal system the need of the hour. The additional biometric trait to be employed to reinforce the voice parameter for speaker recognition was the other concern to be addressed. Voice acquisition being a non-invasive and easy to acquire trait newlineis supplemented with lip movement features to add liveness detection for implementing a newlinerobust speaker recognition system. Lip movement of a speaker being an extension of sound in the process of speech production besides being unique for every individual, made this as the choice for another biometric parameter.A standard audio-video database - VidTIMIT, which is derived from the N-TIMIT corpus, is chosen for this research to explore a text -independent speaker recognition system along with another database of 72 speakers created in the college.
dc.format.extentxix;170
dc.languageEnglish
dc.relation
dc.rightsuniversity
dc.titleAutomatic Text Independent Speaker Recognition Using Voice and Lip Movement
dc.title.alternative
dc.creator.researcherNainan Sumita
dc.subject.keywordEngineering
dc.subject.keywordEngineering and Technology
dc.subject.keywordEngineering Electrical and Electronic
dc.description.note
dc.contributor.guideKulkarni Vaishali
dc.publisher.placeMumbai
dc.publisher.universityNarsee Monjee Institute of Management Studies
dc.publisher.institutionDepartment of Electronic Engineering
dc.date.registered2016
dc.date.completed2021
dc.date.awarded2021
dc.format.dimensions
dc.format.accompanyingmaterialDVD
dc.source.universityUniversity
dc.type.degreePh.D.
Appears in Departments:Department of Electronic Engineering

Files in This Item:
File Description SizeFormat 
03_certificate.pdfAttached File749.59 kBAdobe PDFView/Open
06_table of contents.pdf289.58 kBAdobe PDFView/Open
10_chapter 1.pdf780.52 kBAdobe PDFView/Open
11_chapter 2.pdf1.06 MBAdobe PDFView/Open
12_chapter 3.pdf2.43 MBAdobe PDFView/Open
13_chapter 4.pdf1.37 MBAdobe PDFView/Open
14_chapter 5.pdf781.64 kBAdobe PDFView/Open
15_chapter 6.pdf312.18 kBAdobe PDFView/Open
17_references.pdf554.89 kBAdobe PDFView/Open
80_recommendation.pdf129.41 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial 4.0 International (CC BY-NC 4.0).

Altmetric Badge: