M2A: Motion Aware Attention for Accurate Video Action Recognition
Brennan Gebotys, Alexander Wong, David A. Clausi

TL;DR
This paper introduces Motion Aware Attention (M2A), a novel attention mechanism that explicitly incorporates motion information to improve video action recognition accuracy, outperforming existing methods with minimal added complexity.
Contribution
The paper proposes M2A, a simple, motion-aware attention mechanism that can be integrated into various neural networks to enhance video action recognition performance.
Findings
M2A improves top-1 accuracy by 15-26% across different architectures.
M2A outperforms other motion and attention mechanisms by up to 60% in specific classes.
Incorporating M2A yields significant gains on the Something-Something V1 benchmark.
Abstract
Advancements in attention mechanisms have led to significant performance improvements in a variety of areas in machine learning due to its ability to enable the dynamic modeling of temporal sequences. A particular area in computer vision that is likely to benefit greatly from the incorporation of attention mechanisms in video action recognition. However, much of the current research's focus on attention mechanisms have been on spatial and temporal attention, which are unable to take advantage of the inherent motion found in videos. Motivated by this, we develop a new attention mechanism called Motion Aware Attention (M2A) that explicitly incorporates motion characteristics. More specifically, M2A extracts motion information between consecutive frames and utilizes attention to focus on the motion patterns found across frames to accurately recognize actions in videos. The proposed M2A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
MethodsAttentive Walk-Aggregating Graph Neural Network
