Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving

Tianze Zhu; Yinuo Wang; Wenjun Zou; Tianyi Zhang; Likun Wang; Letian Tao; Feihong Zhang; Yao Lyu; Shengbo Eben Li

arXiv:2603.02613·cs.LG·March 4, 2026

Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving

Tianze Zhu, Yinuo Wang, Wenjun Zou, Tianyi Zhang, Likun Wang, Letian Tao, Feihong Zhang, Yao Lyu, Shengbo Eben Li

PDF

Open Access

TL;DR

This paper introduces DACER-F, a real-time generative policy method for autonomous driving that uses flow matching and Langevin dynamics to generate actions in a single inference step, significantly reducing latency.

Contribution

The paper proposes DACER-F, integrating flow matching with online RL and Langevin dynamics to enable fast, competitive action generation for autonomous driving.

Findings

01

Outperforms baselines in complex driving simulations.

02

Achieves high scores on DeepMind Control Suite.

03

Maintains ultra-low inference latency.

Abstract

Reinforcement learning (RL) is a fundamental methodology in autonomous driving systems, where generative policies exhibit considerable potential by leveraging their ability to model complex distributions to enhance exploration. However, their inherent high inference latency severely impedes their deployment in real-time decision-making and control. To address this issue, we propose diffusion actor-critic with entropy regulator via flow matching (DACER-F) by introducing flow matching into online RL, enabling the generation of competitive actions in a single inference step. By leveraging Langevin dynamics and gradients of the Q-function, DACER-F dynamically optimizes actions from experience replay toward a target distribution that balances high Q-value information with exploratory behavior. The flow policy is then trained to efficiently learn a mapping from a simple prior distribution to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Generative Adversarial Networks and Image Synthesis