Spatial Hierarchy and Temporal Attention Guided Cross Masking for   Self-supervised Skeleton-based Action Recognition

Xinpeng Yin; Wenming Cao

arXiv:2409.17951·cs.CV·September 27, 2024

Spatial Hierarchy and Temporal Attention Guided Cross Masking for Self-supervised Skeleton-based Action Recognition

Xinpeng Yin, Wenming Cao

PDF

Open Access 1 Repo

TL;DR

This paper proposes HA-CM, a novel self-supervised skeleton-based action recognition framework that employs hierarchical and attention-guided cross-masking in both spatial and temporal domains to improve feature learning and robustness.

Contribution

The paper introduces a hierarchical and attention-guided cross-masking framework (HA-CM) that applies novel masking strategies in spatial and temporal aspects for skeleton sequences.

Findings

01

Effective on NTU-60, NTU-120, and PKU-MMD datasets.

02

Improves model robustness and feature learning.

03

Outperforms existing self-supervised methods.

Abstract

In self-supervised skeleton-based action recognition, the mask reconstruction paradigm is gaining interest in enhancing model refinement and robustness through effective masking. However, previous works primarily relied on a single masking criterion, resulting in the model overfitting specific features and overlooking other effective information. In this paper, we introduce a hierarchy and attention guided cross-masking framework (HA-CM) that applies masking to skeleton sequences from both spatial and temporal perspectives. Specifically, in spatial graphs, we utilize hyperbolic space to maintain joint distinctions and effectively preserve the hierarchical structure of high-dimensional skeletons, employing joint hierarchy as the masking criterion. In temporal flows, we substitute traditional distance metrics with the global attention of joints for masking, addressing the convergence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YinxPeng/HA-CM-main
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis

MethodsSoftmax · Attention Is All You Need