Interpretable Action Recognition on Hard to Classify Actions
Anastasia Anichenko, Frank Guerin, Andrew Gilbert

TL;DR
This paper enhances an interpretable video activity recognition model by integrating 3D depth information, significantly improving recognition accuracy for challenging actions in the Something-Something-v2 dataset.
Contribution
The paper introduces a novel 3D-aware extension to an interpretable action recognition model, incorporating object shape and depth relations to better distinguish similar actions.
Findings
Depth relations significantly improved model performance.
Container shape detection did not enhance accuracy.
3D information is crucial for recognizing complex actions.
Abstract
We investigate a human-like interpretable model of video understanding. Humans recognise complex activities in video by recognising critical spatio-temporal relations among explicitly recognised objects and parts, for example, an object entering the aperture of a container. To mimic this we build on a model which uses positions of objects and hands, and their motions, to recognise the activity taking place. To improve this model we focussed on three of the most confused classes (for this model) and identified that the lack of 3D information was the major problem. To address this we extended our basic model by adding 3D awareness in two ways: (1) A state-of-the-art object detection model was fine-tuned to determine the difference between "Container" and "NotContainer" in order to integrate object shape information into the existing object features. (2) A state-of-the-art depth estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Human Pose and Action Recognition · Anomaly Detection Techniques and Applications
