Learning from Mistakes: Post-Training for Driving VLA with Takeover Data
Yinfeng Gao, Deqing Liu, Qichao Zhang, Yupeng Zheng, Haochen Tian, Guang Li, Hangjun Ye, Long Chen, Da-Wei Ding, Dongbin Zhao

TL;DR
This paper introduces TakeVLA, a novel post-training framework for vision-language-action in autonomous driving that improves safety margins and performance by proactive mistake learning and active scenario exploration.
Contribution
The paper presents two innovations: pre-takeover language supervision for proactive mistake learning and Scenario Dreaming for active exploration in reconstructed scenarios.
Findings
Achieves state-of-the-art performance on Bench2Drive benchmark.
Surpasses baseline by 4.93 in driving score.
Increases safety margin with 11.76% higher average TTC.
Abstract
Current Vision-Language-Action (VLA) paradigms in end-to-end autonomous driving rely on offline training from static datasets, leaving them vulnerable to distribution shift. Recent post-training methods use takeover data to mitigate this by augmenting the dataset with high-quality expert takeover samples, yet they suffer from two key limitations: supervision restricted to the period after the takeover moments leads to policies with limited safety margins, and passive preference optimization lacks active exploration for optimal performance. In this paper, we propose TakeVLA, a novel VLA post-training framework that overcomes these shortcomings through two complementary innovations. First, we introduce pre-takeover language supervision, which allows the VLA to learn from mistakes proactively. By explicitly teaching the model about what to do in error-prone situations, we cultivate a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Autonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning
