TL;DR
RetroMotion introduces a retrocausal transformer-based approach for multi-agent motion forecasting, decomposing joint distributions into marginals and pairwise interactions, enabling instruction-following and strong performance on benchmark datasets.
Contribution
It proposes a novel retrocausal modeling framework that improves multi-agent motion prediction and incorporates instruction-following capabilities.
Findings
Achieves strong results in Waymo Interaction Prediction Challenge.
Generalizes well to Argoverse 2 and V2X-Seq datasets.
Provides an interface for issuing and following instructions in motion forecasting.
Abstract
Motion forecasts of road users (i.e., agents) vary in complexity depending on the number of agents, scene constraints, and interactions. In particular, the output space of joint trajectory distributions grows exponentially with the number of agents. Therefore, we decompose multi-agent motion forecasts into (1) marginal distributions for all modeled agents and (2) joint distributions for interacting agents. Using a transformer model, we generate joint distributions by re-encoding marginal distributions followed by pairwise modeling. This incorporates a retrocausal flow of information from later points in marginal trajectories to earlier points in joint trajectories. For each time step, we model the positional uncertainty using compressed exponential power distributions. Notably, our method achieves strong results in the Waymo Interaction Prediction Challenge and generalizes well to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
