The DAWN of World-Action Interactive Models

Hongbo Lu; Liang Yao; Chenghao He; Haoyu Wang; Xiang Gu; Xianfei Li; Wenlong Liao; Tao He; Pai Peng

arXiv:2605.11550·cs.CV·May 13, 2026

The DAWN of World-Action Interactive Models

Hongbo Lu, Liang Yao, Chenghao He, Haoyu Wang, Xiang Gu, Xianfei Li, Wenlong Liao, Tao He, Pai Peng

PDF

1 Repo

TL;DR

The paper introduces DAWN, a novel latent generative model for autonomous driving that couples world prediction with action denoising, enabling recursive refinement and improved long-horizon planning.

Contribution

It formalizes World-Action Interactive Models (WAIMs) and instantiates them in DAWN, a simple yet effective latent model coupling world prediction with action denoising for autonomous driving.

Findings

01

DAWN achieves strong planning performance on autonomous driving benchmarks.

02

DAWN produces favorable safety-related results.

03

DAWN effectively supports long-horizon trajectory generation.

Abstract

A plausible scene evolution depends on the maneuver being considered, while a good maneuver depends on how the scene may evolve. Existing World Action Models (WAMs) largely miss this reciprocity, treating world prediction and action generation as either isolated parallel branches or rigid predict-then-plan pipelines. We formalize this perspective as World-Action Interactive Models (WAIMs), and instantiate it in autonomous driving with \textbf{DAWN} (\textbf{D}enoising \textbf{A}ctions and \textbf{W}orld i\textbf{N}teractive model), a simple yet strong latent generative baseline. DAWN operates in a compact semantic latent space and couples a \emph{World Predictor} with a \emph{World-Conditioned Action Denoiser}: the predicted world hypothesis conditions action denoising, while the denoised action hypothesis is fed back to update the world prediction, so that both are recursively refined…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

coowai/DAWN
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.