Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition
Xinpeng Yin, Wenming Cao

TL;DR
This paper proposes HA-CM, a novel self-supervised skeleton-based action recognition framework that employs hierarchical and attention-guided cross-masking in both spatial and temporal domains to improve feature learning and robustness.
Contribution
The paper introduces a hierarchical and attention-guided cross-masking framework (HA-CM) that applies novel masking strategies in spatial and temporal aspects for skeleton sequences.
Findings
Effective on NTU-60, NTU-120, and PKU-MMD datasets.
Improves model robustness and feature learning.
Outperforms existing self-supervised methods.
Abstract
In self-supervised skeleton-based action recognition, the mask reconstruction paradigm is gaining interest in enhancing model refinement and robustness through effective masking. However, previous works primarily relied on a single masking criterion, resulting in the model overfitting specific features and overlooking other effective information. In this paper, we introduce a hierarchy and attention guided cross-masking framework (HA-CM) that applies masking to skeleton sequences from both spatial and temporal perspectives. Specifically, in spatial graphs, we utilize hyperbolic space to maintain joint distinctions and effectively preserve the hierarchical structure of high-dimensional skeletons, employing joint hierarchy as the masking criterion. In temporal flows, we substitute traditional distance metrics with the global attention of joints for masking, addressing the convergence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis
MethodsSoftmax · Attention Is All You Need
