Pyramid Self-attention Polymerization Learning for Semi-supervised Skeleton-based Action Recognition
Binqian Xu, Xiangbo Shu

TL;DR
This paper introduces PSP Learning, a novel framework that leverages multi-level attention and contrastive learning to improve semi-supervised skeleton-based action recognition by capturing coarse-to-fine semantic features.
Contribution
It proposes a Pyramid Polymerizing Attention mechanism and a Coarse-to-fine Contrastive Loss to jointly learn multi-granularity action representations, enhancing semi-supervised learning performance.
Findings
Achieves competitive results on NTU RGB+D and UCLA datasets.
Effectively captures semantic information at body, part, and joint levels.
Demonstrates the effectiveness of coarse-to-fine contrastive learning.
Abstract
Most semi-supervised skeleton-based action recognition approaches aim to learn the skeleton action representations only at the joint level, but neglect the crucial motion characteristics at the coarser-grained body (e.g., limb, trunk) level that provide rich additional semantic information, though the number of labeled data is limited. In this work, we propose a novel Pyramid Self-attention Polymerization Learning (dubbed as PSP Learning) framework to jointly learn body-level, part-level, and joint-level action representations of joint and motion data containing abundant and complementary semantic information via contrastive learning covering coarse-to-fine granularity. Specifically, to complement semantic information from coarse to fine granularity in skeleton actions, we design a new Pyramid Polymerizing Attention (PPA) mechanism that firstly calculates the body-level attention map,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Medical Imaging and Analysis · Stroke Rehabilitation and Recovery
MethodsContrastive Learning
