Contrast-reconstruction Representation Learning for Self-supervised   Skeleton-based Action Recognition

Peng Wang; Jun Wen; Chenyang Si; Yuntao Qian; Liang Wang

arXiv:2111.11051·cs.CV·February 14, 2023

Contrast-reconstruction Representation Learning for Self-supervised Skeleton-based Action Recognition

Peng Wang, Jun Wen, Chenyang Si, Yuntao Qian, Liang Wang

PDF

TL;DR

This paper introduces CRRL, a novel unsupervised learning framework for skeleton-based action recognition that captures both postures and motion dynamics, significantly outperforming existing methods on multiple benchmarks.

Contribution

The paper proposes a contrast-reconstruction network with a knowledge-distillation fusion strategy for unsupervised skeleton-based action recognition, effectively learning motion and posture representations.

Findings

01

CRRL outperforms state-of-the-art methods on multiple benchmarks.

02

The contrastive motion learner enhances motion representation learning.

03

Knowledge distillation improves the fusion of posture and motion features.

Abstract

Skeleton-based action recognition is widely used in varied areas, e.g., surveillance and human-machine interaction. Existing models are mainly learned in a supervised manner, thus heavily depending on large-scale labeled data which could be infeasible when labels are prohibitively expensive. In this paper, we propose a novel Contrast-Reconstruction Representation Learning network (CRRL) that simultaneously captures postures and motion dynamics for unsupervised skeleton-based action recognition. It mainly consists of three parts: Sequence Reconstructor, Contrastive Motion Learner, and Information Fuser. The Sequence Reconstructor learns representation from skeleton coordinate sequence via reconstruction, thus the learned representation tends to focus on trivial postural coordinates and be hesitant in motion learning. To enhance the learning of motions, the Contrastive Motion Learner…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning