Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/597025
Title: An efficient explainable attention based image captioning system
Researcher: Revathi B S
Guide(s): Meena Kowshalya A
Keywords: Augmentation And Ranking
Encoder-Decoder Architecture
Image Captioning System
University: Anna University
Completed Date: 2024
Abstract: Recent advances in deep learning have brought significant attention to the integration of vision computing and natural language processing. Captioning is a method that enables machines to understand an image and provide a natural language explanation for it. The meaningful captions generated by the image have the ability of analyzing the state, the attributes and the relationship among these objects rather merely identifying the objects in the image. newlineCurrently, the Encoder-Decoder architecture is the most effective way to implement Image Captioning. In order to effectively predict objects, sceneries, and patterns within an image and provide captions, this research work presents an innovative Automatic Image Captioning System. newlineThe encoder and decoder architecture proposed in this research makes use of a novel Augmentation and Ranking (A-R model) mechanism. A rich featured image dataset is produced by augmentation, and a ranking system aids in choosing the top k priority terms. The Ranking LSTM assists in identifying the meaningful captions through ranks. The Image Captioning system performs more effectively due to this blending technique. Greedy and beam search are used to investigate the proposed A-R model under maximum and average pooling. newline
Pagination: xii,113p.
URI: http://hdl.handle.net/10603/597025
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File25.55 kBAdobe PDFView/Open
02_prelim_pages.pdf1.84 MBAdobe PDFView/Open
03_contents.pdf95.78 kBAdobe PDFView/Open
04_abstracts.pdf14.46 kBAdobe PDFView/Open
05_chapter1.pdf242.61 kBAdobe PDFView/Open
06_chapter2.pdf203.44 kBAdobe PDFView/Open
07_chapter3.pdf854.15 kBAdobe PDFView/Open
08_chapter4.pdf486.88 kBAdobe PDFView/Open
09_chapter5.pdf327.62 kBAdobe PDFView/Open
10_chapter6.pdf1.58 MBAdobe PDFView/Open
11_chapter7.pdf21.93 kBAdobe PDFView/Open
12_annexures.pdf127.94 kBAdobe PDFView/Open
80_recommendation.pdf59.66 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: