TL;DR
This paper introduces ITRA, a deep generative model using a differentiable simulator for multi-agent trajectory prediction, achieving state-of-the-art results on the INTERACTION dataset.
Contribution
It presents a novel differentiable simulation framework with CVRNNs for multi-agent trajectory prediction, eliminating the need for ad-hoc diversity losses.
Findings
Achieves state-of-the-art performance on INTERACTION dataset
Both the kinematic bicycle model and birdview feedback are crucial
Produces realistic multi-modal predictions without extra diversity losses
Abstract
We develop a deep generative model built on a fully differentiable simulator for multi-agent trajectory prediction. Agents are modeled with conditional recurrent variational neural networks (CVRNNs), which take as input an ego-centric birdview image representing the current state of the world and output an action, consisting of steering and acceleration, which is used to derive the subsequent agent state using a kinematic bicycle model. The full simulation state is then differentiably rendered for each agent, initiating the next time step. We achieve state-of-the-art results on the INTERACTION dataset, using standard neural architectures and a standard variational training objective, producing realistic multi-modal predictions without any ad-hoc diversity-inducing losses. We conduct ablation studies to examine individual components of the simulator, finding that both the kinematic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
