Joint-Relation Transformer for Multi-Person Motion Prediction
Qingyao Xu, Weibo Mao, Jingze Gong, Chenxin Xu, Siheng Chen, Weidi, Xie, Ya Zhang, Yanfeng Wang

TL;DR
This paper introduces the Joint-Relation Transformer, which incorporates explicit relation information like distances and constraints to enhance multi-person motion prediction accuracy, outperforming existing transformer-based methods.
Contribution
The paper proposes a novel joint-relation fusion layer with relation-aware attention and supervision via future distance forecasting, improving interaction modeling in motion prediction.
Findings
Achieves 13.4% improvement on 900ms VIM metric.
Improves 3s MPJPE by 17.8% on CMU-Mpcap dataset.
Enhances interaction modeling with relation-aware attention.
Abstract
Multi-person motion prediction is a challenging problem due to the dependency of motion on both individual past movements and interactions with other people. Transformer-based methods have shown promising results on this task, but they miss the explicit relation representation between joints, such as skeleton structure and pairwise distance, which is crucial for accurate interaction modeling. In this paper, we propose the Joint-Relation Transformer, which utilizes relation information to enhance interaction modeling and improve future motion prediction. Our relation information contains the relative distance and the intra-/inter-person physical constraints. To fuse relation and joint information, we design a novel joint-relation fusion layer with relation-aware attention to update both features. Additionally, we supervise the relation information by forecasting future distance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Joint-Relation Transformer for Multi-Person Motion Prediction· youtube
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Gait Recognition and Analysis
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Layer Normalization · Label Smoothing · Adam · Residual Connection · Dense Connections · Dropout
