ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition
Masahiro Mitsuhara, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu, Fujiyoshi

TL;DR
This paper introduces ST-ABN, a novel method for visual explanation in video recognition that considers both spatial and temporal information, enhancing interpretability and recognition accuracy.
Contribution
The paper proposes ST-ABN, a spatio-temporal attention mechanism that improves visual explanation and recognition performance in video analysis.
Findings
Enables visual explanation considering spatial and temporal info
Improves recognition accuracy on Something-Something datasets
Demonstrates better interpretability of video decision processes
Abstract
It is difficult for people to interpret the decision-making in the inference process of deep neural networks. Visual explanation is one method for interpreting the decision-making of deep learning. It analyzes the decision-making of 2D CNNs by visualizing an attention map that highlights discriminative regions. Visual explanation for interpreting the decision-making process in video recognition is more difficult because it is necessary to consider not only spatial but also temporal information, which is different from the case of still images. In this paper, we propose a visual explanation method called spatio-temporal attention branch network (ST-ABN) for video recognition. It enables visual explanation for both spatial and temporal information. ST-ABN acquires the importance of spatial and temporal information during network inference and applies it to recognition processing to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Advanced Neural Network Applications
