Evaluation as Evolution: Transforming Adversarial Diffusion into Closed-Loop Curricula for Autonomous Vehicles

Yicheng Guo; Jiaqi Liu; Chengkai Xu; Peng Hang; Jian Sun

arXiv:2604.07378·cs.RO·April 10, 2026

Evaluation as Evolution: Transforming Adversarial Diffusion into Closed-Loop Curricula for Autonomous Vehicles

Yicheng Guo, Jiaqi Liu, Chengkai Xu, Peng Hang, Jian Sun

PDF

TL;DR

This paper presents a novel closed-loop evaluation framework for autonomous vehicles that adaptively generates adversarial scenarios to improve safety testing and policy robustness.

Contribution

It introduces Evaluation as Evolution ($E^2$), transforming static adversarial testing into an adaptive, evolutionary curriculum using transport-regularized control over a learned SDE prior.

Findings

01

$E^2$ improves collision failure discovery by 9.01% on nuScenes.

02

$E^2$ achieves up to 21.43% improvement on nuPlan.

03

Recycling boundary cases for policy fine-tuning enhances robustness.

Abstract

Autonomous vehicles in interactive traffic environments are often limited by the scarcity of safety-critical tail events in static datasets, which biases learned policies toward average-case behaviors and reduces robustness. Existing evaluation methods attempt to address this through adversarial stress testing, but are predominantly open-loop and post-hoc, making it difficult to incorporate discovered failures back into the training process. We introduce Evaluation as Evolution ( $E^{2}$ ), a closed-loop framework that transforms adversarial generation from a static validation step into an adaptive evolutionary curriculum. Specifically, $E^{2}$ formulates adversarial scenario synthesis as transport-regularized sparse control over a learned reverse-time SDE prior. To make this high-dimensional generation tractable, we utilize topology-driven support selection to identify critical interacting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.