PreTR: Spatio-Temporal Non-Autoregressive Trajectory Prediction Transformer
Lina Achaji, Thierno Barry, Thibault Fouqueray, Julien Moreau,, Francois Aioun, Francois Charpillet

TL;DR
PreTR introduces a non-autoregressive Transformer model for pedestrian trajectory prediction that efficiently captures multi-agent spatio-temporal interactions, outperforming previous models in accuracy and computational efficiency.
Contribution
The paper presents PReTR, a novel non-autoregressive Transformer with spatio-temporal attention for trajectory prediction, reducing computational costs and addressing exposure bias.
Findings
Empirically better results on ETH/UCY datasets
Less computational needs compared to prior models
Effective non-autoregressive trajectory prediction
Abstract
Nowadays, our mobility systems are evolving into the era of intelligent vehicles that aim to improve road safety. Due to their vulnerability, pedestrians are the users who will benefit the most from these developments. However, predicting their trajectory is one of the most challenging concerns. Indeed, accurate prediction requires a good understanding of multi-agent interactions that can be complex. Learning the underlying spatial and temporal patterns caused by these interactions is even more of a competitive and open problem that many researchers are tackling. In this paper, we introduce a model called PRediction Transformer (PReTR) that extracts features from the multi-agent scenes by employing a factorized spatio-temporal attention module. It shows less computational needs than previously studied models with empirically better results. Besides, previous works in motion prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Softmax · Residual Connection · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing · Dropout
