Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/597025
Title: | An efficient explainable attention based image captioning system |
Researcher: | Revathi B S |
Guide(s): | Meena Kowshalya A |
Keywords: | Augmentation And Ranking Encoder-Decoder Architecture Image Captioning System |
University: | Anna University |
Completed Date: | 2024 |
Abstract: | Recent advances in deep learning have brought significant attention to the integration of vision computing and natural language processing. Captioning is a method that enables machines to understand an image and provide a natural language explanation for it. The meaningful captions generated by the image have the ability of analyzing the state, the attributes and the relationship among these objects rather merely identifying the objects in the image. newlineCurrently, the Encoder-Decoder architecture is the most effective way to implement Image Captioning. In order to effectively predict objects, sceneries, and patterns within an image and provide captions, this research work presents an innovative Automatic Image Captioning System. newlineThe encoder and decoder architecture proposed in this research makes use of a novel Augmentation and Ranking (A-R model) mechanism. A rich featured image dataset is produced by augmentation, and a ranking system aids in choosing the top k priority terms. The Ranking LSTM assists in identifying the meaningful captions through ranks. The Image Captioning system performs more effectively due to this blending technique. Greedy and beam search are used to investigate the proposed A-R model under maximum and average pooling. newline |
Pagination: | xii,113p. |
URI: | http://hdl.handle.net/10603/597025 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 25.55 kB | Adobe PDF | View/Open |
02_prelim_pages.pdf | 1.84 MB | Adobe PDF | View/Open | |
03_contents.pdf | 95.78 kB | Adobe PDF | View/Open | |
04_abstracts.pdf | 14.46 kB | Adobe PDF | View/Open | |
05_chapter1.pdf | 242.61 kB | Adobe PDF | View/Open | |
06_chapter2.pdf | 203.44 kB | Adobe PDF | View/Open | |
07_chapter3.pdf | 854.15 kB | Adobe PDF | View/Open | |
08_chapter4.pdf | 486.88 kB | Adobe PDF | View/Open | |
09_chapter5.pdf | 327.62 kB | Adobe PDF | View/Open | |
10_chapter6.pdf | 1.58 MB | Adobe PDF | View/Open | |
11_chapter7.pdf | 21.93 kB | Adobe PDF | View/Open | |
12_annexures.pdf | 127.94 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 59.66 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial 4.0 International (CC BY-NC 4.0).
Altmetric Badge: