Severe Domain Shift in Skeleton-Based Action Recognition:A Study of Uncertainty Failure in Real-World Gym Environments
Aaditya Khanal, Junxiu Zhou

TL;DR
This paper investigates the severe domain shift challenges in skeleton-based action recognition when transitioning from controlled to real-world gym environments, highlighting the limitations of current uncertainty detection methods and proposing a gating mechanism for safer deployment.
Contribution
The study introduces a novel Gym2D dataset, systematically analyzes the failure of uncertainty methods under domain shift, and proposes a lightweight gating mechanism to improve safety in real-world applications.
Findings
Skeleton Transformer accuracy drops from 63.2% to below 2% under domain shift.
High OOD detection AUROC does not correlate with safe decision-making.
Gating mechanism significantly reduces confident wrong predictions.
Abstract
The practical deployment gap -- transitioning from controlled multi-view 3D skeleton capture to unconstrained monocular 2D pose estimation -- introduces a compound domain shift whose safety implications remain critically underexplored. We present a systematic study of this severe domain shift using a novel Gym2D dataset (style/viewpoint shift) and the UCF101 dataset (semantic shift). Our Skeleton Transformer achieves 63.2% cross-subject accuracy on NTU-120 but drops to 1.6% under zero-shot transfer to the Gym domain and 1.16% on UCF101. Critically, we demonstrate that high Out-Of-Distribution (OOD) detection AUROC does not guarantee safe selective classification. Standard uncertainty methods fail to detect this performance drop: the model remains confidently incorrect with 99.6% risk even at 50% coverage across both OOD datasets. While energy-based scoring (AUROC >=…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Context-Aware Activity Recognition Systems · Robot Manipulation and Learning
