Attention on Attention for Image Captioning
Lun Huang, Wenmin Wang, Jie Chen, Xiao-Yong Wei

TL;DR
This paper introduces the Attention on Attention (AoA) module, enhancing image captioning models by better assessing attention relevance, leading to state-of-the-art performance on MS COCO dataset.
Contribution
The paper proposes the AoA module that extends traditional attention mechanisms to improve relevance estimation, applied to both encoder and decoder in image captioning.
Findings
Achieved a CIDEr-D score of 129.8 on MS COCO test split.
Outperformed all previous methods in image captioning.
Demonstrated the effectiveness of AoA in both encoder and decoder.
Abstract
Attention mechanisms are widely used in current encoder/decoder frameworks of image captioning, where a weighted average on encoded vectors is generated at each time step to guide the caption decoding process. However, the decoder has little idea of whether or how well the attended vector and the given attention query are related, which could make the decoder give misled results. In this paper, we propose an Attention on Attention (AoA) module, which extends the conventional attention mechanisms to determine the relevance between attention results and queries. AoA first generates an information vector and an attention gate using the attention result and the current context, then adds another attention by applying element-wise multiplication to them and finally obtains the attended information, the expected useful knowledge. We apply AoA to both the encoder and the decoder of our image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
Methodsfast speak--How do I Speak to someone at Expedia?
