Found-RL: foundation model-enhanced reinforcement learning for autonomous driving
Yansong Qu, Zihao Sheng, Zilin Huang, Jiancong Chen, Yuhao Luo, Tianyi Wang, Yiheng Feng, Samuel Labi, Sikai Chen

TL;DR
Found-RL enhances reinforcement learning for autonomous driving by integrating foundation models with an asynchronous inference framework, enabling real-time, context-aware decision-making with high efficiency and interpretability.
Contribution
The paper introduces Found-RL, a platform that efficiently combines foundation models with RL for autonomous driving, featuring an asynchronous inference framework and novel supervision mechanisms.
Findings
Lightweight RL models achieve near-VLM performance.
Real-time inference at approximately 500 FPS.
Effective dense reward shaping using CLIP.
Abstract
Reinforcement Learning (RL) has emerged as a dominant paradigm for end-to-end autonomous driving (AD). However, RL suffers from sample inefficiency and a lack of semantic interpretability in complex scenarios. Foundation Models, particularly Vision-Language Models (VLMs), can mitigate this by offering rich, context-aware knowledge, yet their high inference latency hinders deployment in high-frequency RL training loops. To bridge this gap, we present Found-RL, a platform tailored to efficiently enhance RL for AD using foundation models. A core innovation is the asynchronous batch inference framework, which decouples heavy VLM reasoning from the simulation loop, effectively resolving latency bottlenecks to support real-time learning. We introduce diverse supervision mechanisms: Value-Margin Regularization (VMR) and Advantage-Weighted Action Guidance (AWAG) to effectively distill…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Multimodal Machine Learning Applications
