DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving

Jialv Zou; Shaoyu Chen; Bencheng Liao; Zhiyu Zheng; Yuehao Song; Lefei Zhang; Qian Zhang; Wenyu Liu; Xinggang Wang

arXiv:2512.07745·cs.CV·December 9, 2025

DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving

Jialv Zou, Shaoyu Chen, Bencheng Liao, Zhiyu Zheng, Yuehao Song, Lefei Zhang, Qian Zhang, Wenyu Liu, Xinggang Wang

PDF

Open Access

TL;DR

DiffusionDriveV2 introduces reinforcement learning constraints to truncated diffusion models, significantly improving diversity and quality in end-to-end autonomous driving trajectories while maintaining multimodality.

Contribution

It proposes a novel reinforcement learning framework with intra- and inter-anchor GRPO to enhance diffusion-based autonomous driving models.

Findings

01

Achieves 91.2 PDMS on NAVSIM v1 dataset

02

Achieves 85.5 EPDMS on NAVSIM v2 dataset

03

Sets new state-of-the-art performance in closed-loop evaluation

Abstract

Generative diffusion models for end-to-end autonomous driving often suffer from mode collapse, tending to generate conservative and homogeneous behaviors. While DiffusionDrive employs predefined anchors representing different driving intentions to partition the action space and generate diverse trajectories, its reliance on imitation learning lacks sufficient constraints, resulting in a dilemma between diversity and consistent high quality. In this work, we propose DiffusionDriveV2, which leverages reinforcement learning to both constrain low-quality modes and explore for superior trajectories. This significantly enhances the overall output quality while preserving the inherent multimodality of its core Gaussian Mixture Model. First, we use scale-adaptive multiplicative noise, ideal for trajectory planning, to promote broad exploration. Second, we employ intra-anchor GRPO to manage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Generative Adversarial Networks and Image Synthesis