Survey on the attention based RNN model and its applications in computer vision
Feng Wang, David M.J. Tax

TL;DR
This survey reviews attention-based RNN models, detailing their mechanisms and applications in computer vision, highlighting their advantages over traditional RNNs through experimental results and discussing future research directions.
Contribution
It provides a comprehensive overview of attention-based RNN models and their applications in computer vision, emphasizing their improved ability to model implicit relations.
Findings
Attention mechanisms enhance RNN performance in sequence tasks.
Attention-based RNNs outperform traditional models in computer vision applications.
The survey identifies promising future research directions.
Abstract
The recurrent neural networks (RNN) can be used to solve the sequence to sequence problem, where both the input and the output have sequential structures. Usually there are some implicit relations between the structures. However, it is hard for the common RNN model to fully explore the relations between the sequences. In this survey, we introduce some attention based RNN models which can focus on different parts of the input for each output item, in order to explore and take advantage of the implicit relations between the input and the output items. The different attention mechanisms are described in detail. We then introduce some applications in computer vision which apply the attention based RNN models. The superiority of the attention based RNN model is shown by the experimental results. At last some future research directions are given.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Multimodal Machine Learning Applications
