TL;DR
This paper introduces a self-supervised method for gait-based person re-identification using 3D skeleton data, leveraging locality-aware attention and contrastive learning to improve gait representations without labeled data.
Contribution
It proposes a novel self-supervised gait encoding framework with locality-aware mechanisms, outperforming existing skeleton-based methods and rivaling multi-modal approaches.
Findings
Outperforms skeleton-based methods by 15-40% Rank-1 accuracy.
Achieves superior performance compared to multi-modal methods with RGB or depth.
Introduces a new self-supervised learning scheme with locality-aware attention and contrastive learning.
Abstract
Person re-identification (Re-ID) via gait features within 3D skeleton sequences is a newly-emerging topic with several advantages. Existing solutions either rely on hand-crafted descriptors or supervised gait representation learning. This paper proposes a self-supervised gait encoding approach that can leverage unlabeled skeleton data to learn gait representations for person Re-ID. Specifically, we first create self-supervision by learning to reconstruct unlabeled skeleton sequences reversely, which involves richer high-level semantics to obtain better gait representations. Other pretext tasks are also explored to further improve self-supervised learning. Second, inspired by the fact that motion's continuity endows adjacent skeletons in one skeleton sequence and temporally consecutive skeleton sequences with higher correlations (referred as locality in 3D skeleton data), we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
