AdvMT: Adversarial Motion Transformer for Long-term Human Motion Prediction
Sarmad Idrees, Jongeun Choi, Seokman Sohn

TL;DR
AdvMT introduces an adversarial transformer model that improves long-term human motion prediction accuracy by capturing spatial-temporal dependencies and reducing artifacts through adversarial training.
Contribution
The paper presents a novel Adversarial Motion Transformer that combines transformer encoding with adversarial training for more realistic long-term human motion prediction.
Findings
Significantly improves long-term prediction accuracy.
Reduces artifacts in predicted human motions.
Enhances robustness of short-term predictions.
Abstract
To achieve seamless collaboration between robots and humans in a shared environment, accurately predicting future human movements is essential. Human motion prediction has traditionally been approached as a sequence prediction problem, leveraging historical human motion data to estimate future poses. Beginning with vanilla recurrent networks, the research community has investigated a variety of methods for learning human motion dynamics, encompassing graph-based and generative approaches. Despite these efforts, achieving accurate long-term predictions continues to be a significant challenge. In this regard, we present the Adversarial Motion Transformer (AdvMT), a novel model that integrates a transformer-based motion encoder and a temporal continuity discriminator. This combination effectively captures spatial and temporal dependencies simultaneously within frames. With adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Adam · Layer Normalization · Residual Connection · Absolute Position Encodings · Dropout · Dense Connections
