SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction
Wei Wu, Xiaoxin Feng, Ziyan Gao, Yuheng Kan

TL;DR
SMART introduces a scalable, GPT-style transformer model for autonomous driving motion generation that leverages sequence token prediction, achieving state-of-the-art results and zero-shot generalization with high inference speed.
Contribution
The paper presents a novel sequence token-based transformer approach for motion generation, demonstrating scalability, zero-shot generalization, and superior performance in autonomous driving tasks.
Findings
Achieved state-of-the-art performance on the Sim Agents challenge.
Demonstrated zero-shot generalization from NuPlan to WOMD datasets.
Validated scalability with over 1 billion motion tokens collected.
Abstract
Data-driven autonomous driving motion generation tasks are frequently impacted by the limitations of dataset size and the domain gap between datasets, which precludes their extensive application in real-world scenarios. To address this issue, we introduce SMART, a novel autonomous driving motion generation paradigm that models vectorized map and agent trajectory data into discrete sequence tokens. These tokens are then processed through a decoder-only transformer architecture to train for the next token prediction task across spatial-temporal series. This GPT-style method allows the model to learn the motion distribution in real driving scenarios. SMART achieves state-of-the-art performance across most of the metrics on the generative Sim Agents challenge, ranking 1st on the leaderboards of Waymo Open Motion Dataset (WOMD), demonstrating remarkable inference speed. Moreover, SMART…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Model-Driven Software Engineering Techniques · Software Testing and Debugging Techniques
