OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic

Songyan Zhang; Wenhui Huang; Zhan Chen; Chua Jiahao Collister; Qihang Huang; Chen Lv

arXiv:2512.01830·cs.CV·December 3, 2025

OpenREAD: Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic

Songyan Zhang, Wenhui Huang, Zhan Chen, Chua Jiahao Collister, Qihang Huang, Chen Lv

PDF

Open Access

TL;DR

OpenREAD introduces an end-to-end reinforcement fine-tuning framework using large language models as critics, significantly improving reasoning and planning in autonomous driving by leveraging open-ended reasoning and large-scale annotations.

Contribution

The paper presents a novel open-ended reasoning framework for autonomous driving that integrates large language models as critics in reinforcement fine-tuning, enhancing reasoning and planning capabilities.

Findings

01

Achieves state-of-the-art performance on reasoning benchmarks.

02

Improves both upstream and downstream autonomous driving tasks.

03

Effectively utilizes large-scale Chain-of-Thought annotations.

Abstract

Recently, two-stage fine-tuning strategies, e.g., acquiring essential driving knowledge through supervised fine-tuning (SFT) and further enhancing decision-making and planning via reinforcement fine-tuning (RFT), have shown strong potential in advancing the knowledge-driven autonomous driving (AD) paradigm. However, the learning nature of SFT still limits the generalization of reasoning, thereby constraining the full potential of driving performance. Meanwhile, current RFT approaches are primarily applied to downstream tasks, since scene understanding is an open-ended problem where corresponding rewards are difficult to quantify. To address these limitations, we propose OpenREAD, an OPEN-ended REasoning reinforced vision-language model (VLM)-based autonomous driving (AD) framework that enables end-to-end RFT across the full spectrum from high-level reasoning to low-level trajectory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications