Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/343256
Title: An efficient deep Convolutional neural network approach for object detection and recognition from a video sequence using multi scale anchor box
Researcher: Garg, Dweepna
Guide(s): Kotecha, Ketan
Keywords: Computer Science
Computer Science Hardware and Architecture
Engineering and Technology
Neural networks (Computer science)
University: Parul University
Completed Date: 2020
Abstract: Deep learning is a new era of machine learning which trains computers to find the newlinestructure from a massive amount of data. Learning is described at multiple levels of newlinerepresentation. This enables us to make sense of the data consisting of text, sound, newlineand images. Many computer vision problems such as object detection, image newlineclassification, and semantic segmentation have been solved using convolutional newlineneural networks. Object detection in videos involves confirming the presence of the newlineobject in the image or video and then locating it accurately for recognition. Detecting newlineand recognizing the still object from an image has comparatively shown better newlineperformance with the use of detection frameworks like R-CNN, Fast R-CNN, Faster newlineR-CNN etc. The challenge is to detect and recognize the moving object from a static newlinecamera efficiently and accurately. In the video, modeling techniques suffer from high newlinecomputation and memory costs which may lead to a decrease in performance newlinemeasures such as accuracy and efficiency to identify the object accurately. The newlinemotive behind this work is to accurately detect and recognize the moving and still newlineobject from a video sequence using deep learning in real-time. The existing newlinealgorithms of object detection based on the deep convolution neural network worked newlinewell for large-size objects as the detection models get better results. However, those newlinemodels fail to detect the varying size of the objects that have low resolution. This is newlinebecause the features do not fully represent the essential characteristics of the objects newlinein real-time after going through the repeated convolution operations of existing newlinemodels. The proposed work improves the accuracy of detection by extracting the newlinefeatures of object at different size and scale by using a multi-scale anchor box. With newlinethe help of CNN, the deep knowledge from the dataset is extracted by giving the newlinemodel a rigorous set of training samples.
Pagination: xv,98
URI: http://hdl.handle.net/10603/343256
Appears in Departments:Department of Computer Science Engineering (CSE)

Files in This Item:
File Description SizeFormat 
01_cover page.pdfAttached File18.11 kBAdobe PDFView/Open
02_certificates.pdf423.46 kBAdobe PDFView/Open
03_preliminary pages.pdf833.4 kBAdobe PDFView/Open
04_chapter-1.pdf357.4 kBAdobe PDFView/Open
05_chapter-2.pdf870.14 kBAdobe PDFView/Open
06_chapter-3.pdf630.54 kBAdobe PDFView/Open
07_chapter-4.pdf3.54 MBAdobe PDFView/Open
08_chapter-5.pdf1.31 MBAdobe PDFView/Open
09_chapter-6.pdf148 kBAdobe PDFView/Open
80_recommendation.pdf153.18 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: