Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/458438
Title: Performance analysis of different deep learning architectures for Hand action recognition
Researcher: Rubin Bose, S
Guide(s): Sathiesh Kumar, V
Keywords: Engineering and Technology
Computer Science
Telecommunications
Deep learning
Hand action recognition
Convolution Neural Network
University: Anna University
Completed Date: 2022
Abstract: Recognizing the hand actions in an unrestrained context is a challenging computer vision task. Computational cost, rapid movement, illumination changes, self-occlusion, uncertain environment, varying viewpoint, varying hand shape, size, and high degrees of freedom (DOF) are the factors that impact the performance of the hand action recognition system. To address the above specified challenges in the area of hand action recognition two different deep Convolution Neural Network (CNN) based approaches namely, multi-stage CNN and single-stage CNN are proposed and reported in this thesis. The existing standard hand action datasets do not consider most of the complexities or challenges as quoted earlier. Hence, a hand action dataset that can be used for real-time hand action recognition is collected and named MITI-HD . All the below mentioned contributions are evaluated using two standard datasets (NUSHP-II and Senz-3D) and a custom developed dataset (MITI-HD). Each model is trained using different Stochastic Gradient Descent Optimizers (Adam, Momentum, and RMSprop). The Faster R-CNN Inception-V2 is a multi-stage CNN approach utilized to perform a real-time hand action recognition. Inception-V2 is used as a backbone feature extraction network. The proposed model using Adam optimizer produces better performance (Average Precision (AP) = 99.10%, Average Recall (AR) = 96.78%, F1-Score = 97.98%, and Prediction time = 140 ms) than the other optimizers on the MITI-HD dataset. The single-stage CNN based six different deep learning models are evaluated in relation to real-time hand action recognition. newline
Pagination: xvi.189p.
URI: http://hdl.handle.net/10603/458438
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File1.38 MBAdobe PDFView/Open
02_prelim pages.pdf2.51 MBAdobe PDFView/Open
03_content.pdf379.91 kBAdobe PDFView/Open
04_abstract.pdf134.01 kBAdobe PDFView/Open
05_chapter 1.pdf687.83 kBAdobe PDFView/Open
06_chapter 2.pdf1.2 MBAdobe PDFView/Open
07_chapter 3.pdf1.44 MBAdobe PDFView/Open
08_chapter 4.pdf3.49 MBAdobe PDFView/Open
09_chapter 5.pdf1.69 MBAdobe PDFView/Open
10_chapter 6.pdf1.75 MBAdobe PDFView/Open
11_annexures.pdf110.31 kBAdobe PDFView/Open
80_recommendation.pdf103.81 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: