CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving

Yihong Guo; Dongqiangzi Ye; Sijia Chen; Anqi Liu; Xianming Liu

arXiv:2603.15771·cs.RO·March 18, 2026

CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving

Yihong Guo, Dongqiangzi Ye, Sijia Chen, Anqi Liu, Xianming Liu

PDF

Open Access

TL;DR

CorrectionPlanner introduces a self-correcting autoregressive planning approach for autonomous driving, utilizing reinforcement learning and a collision critic to reduce collisions and improve planning performance.

Contribution

It presents a novel self-correction mechanism in autonomous driving planning using motion-token generation and reinforcement learning, enhancing safety and effectiveness.

Findings

01

Reduces collision rate by over 20% on Waymax

02

Achieves state-of-the-art planning scores on nuPlan

03

Demonstrates effective self-correction in autonomous driving scenarios

Abstract

Autonomous driving requires safe planning, but most learning-based planners lack explicit self-correction ability: once an unsafe action is proposed, there is no mechanism to correct it. Thus, we propose CorrectionPlanner, an autoregressive planner with self-correction that models planning as motion-token generation within a propose, evaluate, and correct loop. At each planning step, the policy proposes an action, namely a motion token, and a learned collision critic predicts whether it will induce a collision within a short horizon. If the critic predicts a collision, we retain the sequence of historical unsafe motion tokens as a self-correction trace, generate the next motion token conditioned on it, and repeat this process until a safe motion token is proposed or the safety criterion is met. This self-correction trace, consisting of all unsafe motion tokens, represents the planner's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms