An Investigation of Videos for Crowd Analysis

Tomar, Ankit

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/480375

Title:	An Investigation of Videos for Crowd Analysis
Researcher:	Tomar, Ankit
Guide(s):	Kumar, Santosh and Pant, Bhasker
Keywords:	Computer Science Computer Science Artificial Intelligence Engineering and Technology
University:	Graphic Era University
Completed Date:	2023
Abstract:	Day-by-day exploring biological diversity caused an exponential increase in population in the last few decades that necessitated the utmost need for safety, security, and management of crowd mobility in urban planning. Manual observing crowd dynamics is a complex enough process, which often produces the wrong results. Understanding crowded sequences greatly interests computer researchers because of its significant concern for research application fields. AI scientists have observed remarkable progress for the past two decades in developing the cognitive comprehension abilities of crowds in video surveillance using computer vision techniques. Posture/light variation, occlusion, shadow effects, and perspective distortion are intrinsic visual challenges leading to crowd analysis of complex phenomena, as these constraints limit the performance of computer vision (CV) mechanisms. newlineThis thesis introduces some self-solution-based deep-learning approaches to simulate crowd behavior in video surveillance. Four research objectives have been examined to investigate crowd dynamics, which focus on density estimation-based crowd counting, the development of automated crowd video surveillance development, and a human facial recognition model with useful motion information. The significant contributions of the thesis are as follows. newlineThe thesis begins with handling the occlusion issue for pedestrian estimation in dense crowd frames and proposed a dynamic kernel convolution neural network-linear regression (DKCNN-LR) model. This model works in two phases; first, a DKCNN model employs convolutional layers in such a way that the kernel weight of each subsequent layer is half the weight of the previous layer. Among all, the first three heavy kernel load layers identify distant camera regions (low-level) features. Later, the lighter kernel weighting layers help identify near-camera region (high-level) features. Second, a linear regression model is employed to perform parametric regression between the actual count (ground truth) and th
Pagination:
URI:	http://hdl.handle.net/10603/480375
Appears in Departments:	Department of Computer Science and Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	120.31 kB	Adobe PDF	View/Open
02_prelim pages.pdf		627.71 kB	Adobe PDF	View/Open
03_contents.pdf		81.43 kB	Adobe PDF	View/Open
04_abstract.pdf		69.46 kB	Adobe PDF	View/Open
05_chapter 1.pdf		671.81 kB	Adobe PDF	View/Open
06_chapter 2.pdf		3.03 MB	Adobe PDF	View/Open
07_chapter 3.pdf		8.6 MB	Adobe PDF	View/Open
08_chapter 4.pdf		7.59 MB	Adobe PDF	View/Open
09_chapter 5.pdf		6.47 MB	Adobe PDF	View/Open
10_chapter 6.pdf		7.77 MB	Adobe PDF	View/Open
11_chapter 7.pdf		84.56 kB	Adobe PDF	View/Open
12_annextures.pdf		216.37 kB	Adobe PDF	View/Open
80_recommendation.pdf		206.69 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET