GSDrive: Reinforcing Driving Policies by Multi-mode Future Trajectory Probing with 3D Gaussian Splatting Environment

Ziang Guo; Chen Min; Xuefeng Zhang; Yixiao Zhou; Shuo Wang; Sifa Zheng; Dzmitry Tsetserukou; Zufeng Zhang

arXiv:2604.28111·cs.RO·May 18, 2026

GSDrive: Reinforcing Driving Policies by Multi-mode Future Trajectory Probing with 3D Gaussian Splatting Environment

Ziang Guo, Chen Min, Xuefeng Zhang, Yixiao Zhou, Shuo Wang, Sifa Zheng, Dzmitry Tsetserukou, Zufeng Zhang

PDF

1 Repo

TL;DR

GSDrive introduces a novel framework combining imitation learning and reinforcement learning with a 3D Gaussian Splatting environment to improve end-to-end autonomous driving policies through future trajectory probing and reward shaping.

Contribution

It presents a multi-mode trajectory probing method using a differentiable 3D environment, enhancing policy learning with dense rewards and iterative refinement.

Findings

01

Outperforms other simulation-based RL methods on nuScenes dataset.

02

Uses a cyclic IL-RL training loop for iterative policy improvement.

03

Demonstrates effective future-aware trajectory evaluation in 3D environment.

Abstract

End-to-end (E2E) autonomous driving aims to directly map sensory observations to driving actions, but its real-world deployment is hindered by evolving data distributions and the high cost of continual annotation. While combining imitation learning (IL) and reinforcement learning (RL) is a common strategy for policy improvement, conventional RL training relies on delayed, event-based rewards, where policies learn only from catastrophic outcomes such as collisions, leading to premature convergence to suboptimal behaviors. To address these limitations, we propose GSDrive, a framework that uses a differentiable 3D Gaussian Splatting (3DGS) environment for future-aware trajectory probing and reward shaping in E2E driving. GSDrive first learns a multi-mode trajectory probe via IL and then uses RL to evaluate multiple candidate futures in the 3DGS environment, converting their simulated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZionGo6/GSDrive
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.