Motif Guided Graph Transformer with Combinatorial Skeleton Prototype Learning for Skeleton-Based Person Re-Identification
Haocong Rao, Chunyan Miao

TL;DR
This paper introduces MoCos, a novel skeleton-based person re-ID method that leverages motif-guided graph transformers and combinatorial prototype learning to focus on key body structures and gait patterns, improving discriminative representation learning.
Contribution
The paper proposes a new motif-guided graph transformer and combinatorial skeleton prototype learning approach for enhanced skeleton representation in person re-ID.
Findings
MoCos outperforms existing state-of-the-art models.
Effective in RGB-estimated skeletons and unsupervised scenarios.
Enhances focus on gait and body structure in skeleton-based re-ID.
Abstract
Person re-identification (re-ID) via 3D skeleton data is a challenging task with significant value in many scenarios. Existing skeleton-based methods typically assume virtual motion relations between all joints, and adopt average joint or sequence representations for learning. However, they rarely explore key body structure and motion such as gait to focus on more important body joints or limbs, while lacking the ability to fully mine valuable spatial-temporal sub-patterns of skeletons to enhance model learning. This paper presents a generic Motif guided graph transformer with Combinatorial skeleton prototype learning (MoCos) that exploits structure-specific and gait-related body relations as well as combinatorial features of skeleton graphs to learn effective skeleton representations for person re-ID. In particular, motivated by the locality within joints' structure and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGait Recognition and Analysis · Human Pose and Action Recognition · Video Surveillance and Tracking Methods
MethodsAttention Is All You Need · Adam · Dropout · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Label Smoothing
