Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/543112
Title: Human Activity Recognition in UAV Videos using Deep Learning Techniques
Researcher: Sireesha, Gundu
Guide(s): Hussain, Syed
Keywords: human detection
human segmentation
image classification
University: Vellore Institute of Technology (VIT-AP)
Completed Date: 2023
Abstract: An Unmanned Aerial Vehicle (UAV) is an aircraft that operates without a pilot on newlineboard and possesses the ability to reach inaccessible areas. It captures high-quality im- ages or videos at a reduced cost, revolutionizing various fields such as surveillance, land surveying, media, agriculture, and emergency management. However, the increasing reliance on UAVs for surveillance and investigation calls for precise human detection and tracking. The freedom of flight presents various challenges in this regard. Over the past few years, there has been a significant transformation in vision-based human detection and tracking methodologies, moving away from traditional methods and em- bracing deep learning techniques. Deep learning enables automatic feature learning,and the rich content obtained from UAVs opens up new avenues for exploration. newlineThis thesis introduces effective methodologies for human detection and tracking in UAV videos. A groundbreaking framework for optimizing UAV videos is devel- newlineoped, specifically designed for human tracking purposes. The framework incorporates newlinekeyframe extraction, principal keyframe selection, and human path analysis techniques newlineto achieve highly optimized UAV videos with minimal path deviation. newlineThe initial research presents a neural network framework named Mask-Recurrent newlineNeural Network (Mask-RCNN), which employs advanced deep learning methods to newlineidentify and categorize individuals in aerial imagery data. Moreover, the use of His- newlinetogram Oriented Gradients (HOG) algorithm is demonstrated to localize specific re- newlinegions containing objects within the image. The study also introduces an algorithm newlinecalled histogram-based Mask-RCNN, which forms the core of the proposed framework newlineand governs its overall functionality. newlineThe second study introduces a human action recognition model for UAV videos, newlineutilizing the Inflated I3D-ConvNet (Inflated I3D) and Bidirectional Long Short-Term newlineMemory (Bi-LSTM) approaches to achieve accurate recognition of single human ac- newlinetivities. The initial module of the mod
Pagination: xiv,126
URI: http://hdl.handle.net/10603/543112
Appears in Departments:Department of Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File27.49 kBAdobe PDFView/Open
02_prelim pages.pdf3.7 MBAdobe PDFView/Open
03_content.pdf1.2 MBAdobe PDFView/Open
04_abstract.pdf1.54 MBAdobe PDFView/Open
05_chapter-1.pdf3.09 MBAdobe PDFView/Open
06_chapter_2.pdf11.72 MBAdobe PDFView/Open
07_chapter_3.pdf15.55 MBAdobe PDFView/Open
08_chapter_4.pdf13.45 MBAdobe PDFView/Open
09_chapter_5.pdf15.51 MBAdobe PDFView/Open
10_chapter_6.pdf14.19 MBAdobe PDFView/Open
10_references_publications.pdf12.67 MBAdobe PDFView/Open
80_recommendation.pdf6.45 MBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: