Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/373376
Title: Development of multimodal system for emotion recognition
Researcher: Lovejit Singh
Guide(s): Sarbjeet Singh and Aggarwal, Naveen
Keywords: Audio-video Signals
Emotion Recognition
Machine learning
Multimodal System
Peak stage selection
University: Panjab University
Completed Date: 2021
Abstract: Due to the significance of human behavioral intelligence in computing devices, this thesis focused on the facial expressions and speech of humans for their emotion recognition in multimodal signals. A comprehensive literature survey is conducted to study and analyze the existing multimodal datasets and state-of-the-art methods for emotion recognition. Considering various research issues and challenges, two main directions of research is aimed in this thesis work. In the first direction, Panjab University Multilingual Audio and Video Facial Expression (PUMAVE) dataset is proposed for multimodal emotion recognition in Indian context. The PUMAVE dataset is the first multilingual tone-sensitive audio and video facial expression dataset which provides emotional labeled clips of subjects who speak the native language (Punjabi and Hindi) and foreign language (English). The second direction of research is the analysis and development of a multimodal system for human emotion recognition through facial expressions and tone-sensitive speech. The experiment analysis is performed on various feature extraction methods, temporal feature aggregation methods and fusion strategies to determine the best approach for emotion recognition through facial expressions and tone-sensitive speech. The findings of the experimental analysis established the foundation for the development of a multimodal system through fusion of the peak stage behavior of facial expression and speech for emotion recognition in multimodal signals. The proposed multimodal system contributes a novel method for the determination of peak stage frame through analyzing the behavior of Facial Action Units (FAUs) and speech with Improved-Technique for Order of Preference by Similarity to Ideal Solution (I-TOPSIS) method. newline
Pagination: xxxiv, 322p.
URI: http://hdl.handle.net/10603/373376
Appears in Departments:University Institute of Engineering and Technology

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File27.22 kBAdobe PDFView/Open
02_certificate.pdf1.64 MBAdobe PDFView/Open
03_abstract.pdf39.56 kBAdobe PDFView/Open
04_acknowledgement.pdf24.25 kBAdobe PDFView/Open
05_contents.pdf27.8 kBAdobe PDFView/Open
06_list_of_figures.pdf51.26 kBAdobe PDFView/Open
07_list_of_tables.pdf39.73 kBAdobe PDFView/Open
08_list_of_abbreviations.pdf22.06 kBAdobe PDFView/Open
09_chapter1.pdf1.52 MBAdobe PDFView/Open
10_chapter2.pdf220.62 kBAdobe PDFView/Open
11_chapter3.pdf1.24 MBAdobe PDFView/Open
12_chapter4.pdf7.61 MBAdobe PDFView/Open
13_chapter5.pdf5.58 MBAdobe PDFView/Open
14_chapter6.pdf71.91 kBAdobe PDFView/Open
15_list_of_publications.pdf22.11 kBAdobe PDFView/Open
16_bibliography.pdf107.14 kBAdobe PDFView/Open
80_recommendation.pdf150.03 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: