HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving

Zhiwen Chen; Bo Leng; Zhuoren Li; Hanming Deng; Guizhe Jin; Ran Yu; Huanxi Wen

arXiv:2505.15793·cs.RO·May 23, 2025

HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving

Zhiwen Chen, Bo Leng, Zhuoren Li, Hanming Deng, Guizhe Jin, Ran Yu, Huanxi Wen

PDF

Open Access

TL;DR

This paper introduces HCRMP, a novel framework that combines LLM-generated semantic hints with reinforcement learning to improve autonomous driving performance and safety in complex scenarios.

Contribution

It proposes a new LLM-Hinted RL paradigm and the HCRMP architecture, which effectively integrates semantic hints with RL to enhance motion planning and safety.

Findings

01

Achieves up to 80.3% task success rate in diverse conditions.

02

Reduces collision rate by 11.4% in safety-critical scenarios.

03

Demonstrates strong driving performance in CARLA simulations.

Abstract

Integrating Large Language Models (LLMs) with Reinforcement Learning (RL) can enhance autonomous driving (AD) performance in complex scenarios. However, current LLM-Dominated RL methods over-rely on LLM outputs, which are prone to hallucinations. Evaluations show that state-of-the-art LLM indicates a non-hallucination rate of only approximately 57.95% when assessed on essential driving-related tasks. Thus, in these methods, hallucinations from the LLM can directly jeopardize the performance of driving policies. This paper argues that maintaining relative independence between the LLM and the RL is vital for solving the hallucinations problem. Consequently, this paper is devoted to propose a novel LLM-Hinted RL paradigm. The LLM is used to generate semantic hints for state augmentation and policy optimization to assist RL agent in motion planning, while the RL agent counteracts potential…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Transportation and Mobility Innovations

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator