Multimodal Motion Prediction with Stacked Transformers
Yicheng Liu, Jinghuai Zhang, Liangji Fang, Qinhong Jiang, Bolei Zhou

TL;DR
This paper introduces mmTransformer, a novel stacked transformer framework for multimodal vehicle motion prediction, which models multiple future trajectories with fixed proposals, improving diversity and accuracy over previous methods.
Contribution
The paper presents a new transformer-based architecture with a region-based training strategy for multimodal motion prediction, enhancing diversity and accuracy.
Findings
Achieves state-of-the-art performance on Argoverse dataset.
Substantially improves diversity of predicted trajectories.
Enhances prediction accuracy compared to prior methods.
Abstract
Predicting multiple plausible future trajectories of the nearby vehicles is crucial for the safety of autonomous driving. Recent motion prediction approaches attempt to achieve such multimodal motion prediction by implicitly regularizing the feature or explicitly generating multiple candidate proposals. However, it remains challenging since the latent features may concentrate on the most frequent mode of the data while the proposal-based methods depend largely on the prior knowledge to generate and select the proposals. In this work, we propose a novel transformer framework for multimodal motion prediction, termed as mmTransformer. A novel network architecture based on stacked transformers is designed to model the multimodality at feature level with a set of fixed independent proposals. A region-based training strategy is then developed to induce the multimodality of the generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
