Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/480375
Title: | An Investigation of Videos for Crowd Analysis |
Researcher: | Tomar, Ankit |
Guide(s): | Kumar, Santosh and Pant, Bhasker |
Keywords: | Computer Science Computer Science Artificial Intelligence Engineering and Technology |
University: | Graphic Era University |
Completed Date: | 2023 |
Abstract: | Day-by-day exploring biological diversity caused an exponential increase in population in the last few decades that necessitated the utmost need for safety, security, and management of crowd mobility in urban planning. Manual observing crowd dynamics is a complex enough process, which often produces the wrong results. Understanding crowded sequences greatly interests computer researchers because of its significant concern for research application fields. AI scientists have observed remarkable progress for the past two decades in developing the cognitive comprehension abilities of crowds in video surveillance using computer vision techniques. Posture/light variation, occlusion, shadow effects, and perspective distortion are intrinsic visual challenges leading to crowd analysis of complex phenomena, as these constraints limit the performance of computer vision (CV) mechanisms. newlineThis thesis introduces some self-solution-based deep-learning approaches to simulate crowd behavior in video surveillance. Four research objectives have been examined to investigate crowd dynamics, which focus on density estimation-based crowd counting, the development of automated crowd video surveillance development, and a human facial recognition model with useful motion information. The significant contributions of the thesis are as follows. newlineThe thesis begins with handling the occlusion issue for pedestrian estimation in dense crowd frames and proposed a dynamic kernel convolution neural network-linear regression (DKCNN-LR) model. This model works in two phases; first, a DKCNN model employs convolutional layers in such a way that the kernel weight of each subsequent layer is half the weight of the previous layer. Among all, the first three heavy kernel load layers identify distant camera regions (low-level) features. Later, the lighter kernel weighting layers help identify near-camera region (high-level) features. Second, a linear regression model is employed to perform parametric regression between the actual count (ground truth) and th |
Pagination: | |
URI: | http://hdl.handle.net/10603/480375 |
Appears in Departments: | Department of Computer Science and Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 120.31 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 627.71 kB | Adobe PDF | View/Open | |
03_contents.pdf | 81.43 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 69.46 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 671.81 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 3.03 MB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 8.6 MB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 7.59 MB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 6.47 MB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 7.77 MB | Adobe PDF | View/Open | |
11_chapter 7.pdf | 84.56 kB | Adobe PDF | View/Open | |
12_annextures.pdf | 216.37 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 206.69 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: