BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction
Zikang Zhou, Haibo Hu, Xinhong Chen, Jianping Wang, Nan Guan, Kui Wu,, Yung-Hui Li, Yu-Kai Huang, Chun Jason Xue

TL;DR
BehaviorGPT is a fully autoregressive Transformer-based simulator that models traffic agent behaviors without separating history and future, achieving high realism and efficiency in autonomous driving simulations.
Contribution
It introduces a homogeneous, autoregressive Transformer and the Next-Patch Prediction paradigm to improve traffic agent simulation accuracy and data efficiency.
Findings
Won first place in Waymo Open Sim Agents Challenge 2024
Achieved a realism score of 0.7473
Attained a minADE score of 1.4147
Abstract
Simulating realistic behaviors of traffic agents is pivotal for efficiently validating the safety of autonomous driving systems. Existing data-driven simulators primarily use an encoder-decoder architecture to encode the historical trajectories before decoding the future. However, the heterogeneity between encoders and decoders complicates the models, and the manual separation of historical and future trajectories leads to low data utilization. Given these limitations, we propose BehaviorGPT, a homogeneous and fully autoregressive Transformer designed to simulate the sequential behavior of multiple agents. Crucially, our approach discards the traditional separation between "history" and "future" by modeling each time step as the "current" one for motion generation, leading to a simpler, more parameter- and data-efficient agent simulator. We further introduce the Next-Patch Prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Transportation and Mobility Innovations · Traffic Prediction and Management Techniques
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection
