Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models
Liam Hiley, Alun Preece, Yulia Hicks, Supriyo Chakraborty and, Prudhvi Gurram, Richard Tomsett

TL;DR
This paper introduces a method to adapt 2D explanation techniques for 3D activity recognition models, enabling clearer understanding of motion's role in model decisions, which was previously obscured.
Contribution
The authors propose a selective relevance method that enhances 2D explanation techniques to specifically highlight motion information in 3D activity recognition models.
Findings
Improves explanation selectivity for motion in 3D models
Reveals and quantifies the model's spatial bias
Simplifies explanations for human understanding
Abstract
A small subset of explainability techniques developed initially for image recognition models has recently been applied for interpretability of 3D Convolutional Neural Network models in activity recognition tasks. Much like the models themselves, the techniques require little or no modification to be compatible with 3D inputs. However, these explanation techniques regard spatial and temporal information jointly. Therefore, using such explanation techniques, a user cannot explicitly distinguish the role of motion in a 3D model's decision. In fact, it has been shown that these models do not appropriately factor motion information into their decision. We propose a selective relevance method for adapting the 2D explanation techniques to provide motion-specific explanations, better aligning them with the human understanding of motion as conceptually separate from static spatial features. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Human Pose and Action Recognition · Multimodal Machine Learning Applications
MethodsInterpretability
