Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/566981
Title: | Contextual understanding on natural scene images for improved annotation using heuristics and methods |
Researcher: | Selvin Ebenezer, S |
Guide(s): | Raghuveera, T |
Keywords: | Computer Science Computer Science Information Systems Engineering and Technology image annotation techniques Natural Scene Images revolutionized computer vision |
University: | Anna University |
Completed Date: | 2023 |
Abstract: | Advancements in object or image annotation techniques have revolutionized computer vision and image analysis applications. By accurately labeling and marking objects or regions of interest within images, these advancements enable more efficient and accurate understanding of visual data in Natural Scene Images (NSI). Object detection and recognition have greatly benefited from improved annotation techniques, resulting in enhanced algorithms for identifying and classifying objects. Image annotation has played a vital role in training models for image captioning and understanding, as well as in augmented reality and virtual reality applications where virtual content needs to align seamlessly with the real world. In general, advancements in object or image annotation have greatly enhanced the accuracy and efficiency of computer vision algorithms, impacting diverse fields such as healthcare, transportation, assisting visually impaired, entertainment, and beyond. newlineFirst, this work presents an object detection scheme that utilizes the AlexNet deep learning model as its base. Different optimizers including SGDM, RMSProp, and Adam are employed during the execution of the AlexNet model, with performance evaluation conducted on the Flicker dataset. The results demonstrate that the Adam optimizer outperforms the others. In addition to object detection, this work introduces a context-based Hidden Markov Model (HIV IM) for improving the Image annotation based on heuristic attributes of objects and their inter-relationships. The HMM model enhances the understanding of the objects by generating annotation for the image that optimally matches the synonymous captions present in the dataset. newline |
Pagination: | xvi,136p. |
URI: | http://hdl.handle.net/10603/566981 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 235.33 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 5 MB | Adobe PDF | View/Open | |
03_content.pdf | 197.39 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 590.71 kB | Adobe PDF | View/Open | |
05_chapter1.pdf | 838.92 kB | Adobe PDF | View/Open | |
06_chapter2.pdf | 416.15 kB | Adobe PDF | View/Open | |
07_chapter3.pdf | 1.5 MB | Adobe PDF | View/Open | |
08_chapter4.pdf | 2.44 MB | Adobe PDF | View/Open | |
09_chapter5.pdf | 2.35 MB | Adobe PDF | View/Open | |
10_annexures.pdf | 1.96 MB | Adobe PDF | View/Open | |
80_recommendation.pdf | 54.74 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: