HAD: Combining Hierarchical Diffusion with Metric-Decoupled RL for End-to-End Driving

Wenhao Yao; Xinglong Sun; Zhenxin Li; Shiyi Lan; Zi Wang; Jose M. Alvarez; Zuxuan Wu

arXiv:2604.03581·cs.RO·April 7, 2026

HAD: Combining Hierarchical Diffusion with Metric-Decoupled RL for End-to-End Driving

Wenhao Yao, Xinglong Sun, Zhenxin Li, Shiyi Lan, Zi Wang, Jose M. Alvarez, Zuxuan Wu

PDF

TL;DR

HAD introduces a hierarchical diffusion planning framework with structured trajectory expansion and metric-decoupled RL, significantly improving end-to-end autonomous driving performance.

Contribution

The paper presents a novel hierarchical diffusion policy, structure-preserved trajectory expansion, and metric-decoupled policy optimization for better autonomous driving.

Findings

01

Achieved +2.3 EPDMS on NAVSIM

02

Achieved +4.9 Route Completion on HUGSIM

03

Outperformed prior methods by a large margin

Abstract

End-to-end planning has emerged as a dominant paradigm for autonomous driving, where recent models often adopt a scoring-selection framework to choose trajectories from a large set of candidates, with diffusion-based decoding showing strong promise. However, directly selecting from the entire candidate space remains difficult to optimize, and Gaussian perturbations used in diffusion often introduce unrealistic trajectories that complicate the denoising process. In addition, for training these models, reinforcement learning (RL) has shown promise, but existing end-to-end RL approaches typically rely on a single coupled reward without structured signals, limiting optimization effectiveness. To address these challenges, we propose HAD, an end-to-end planning framework with a Hierarchical Diffusion Policy that decomposes planning into a coarse-to-fine process. To improve trajectory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.