Are LLMs The Way Forward? A Case Study on LLM-Guided Reinforcement Learning for Decentralized Autonomous Driving

Timur Anvar; Jeffrey Chen; Yuyan Wang; Rohan Chandra

arXiv:2511.12751·cs.LG·November 18, 2025

Are LLMs The Way Forward? A Case Study on LLM-Guided Reinforcement Learning for Decentralized Autonomous Driving

Timur Anvar, Jeffrey Chen, Yuyan Wang, Rohan Chandra

PDF

Open Access

TL;DR

This study evaluates the potential of small, locally deployed LLMs to enhance reinforcement learning for autonomous highway driving by reward shaping, revealing benefits and notable limitations in safety-critical scenarios.

Contribution

It introduces a hybrid approach where small LLMs augment RL rewards, compares it with RL-only and LLM-only methods, and analyzes their effectiveness and limitations in autonomous driving.

Findings

01

RL-only agents achieve 73-89% success rates

02

LLM-only agents reach up to 94% success but are slower

03

Hybrid approaches perform between RL-only and LLM-only methods

Abstract

Autonomous vehicle navigation in complex environments such as dense and fast-moving highways and merging scenarios remains an active area of research. A key limitation of RL is its reliance on well-specified reward functions, which often fail to capture the full semantic and social complexity of diverse, out-of-distribution situations. As a result, a rapidly growing line of research explores using Large Language Models (LLMs) to replace or supplement RL for direct planning and control, on account of their ability to reason about rich semantic context. However, LLMs present significant drawbacks: they can be unstable in zero-shot safety-critical settings, produce inconsistent outputs, and often depend on expensive API calls with network latency. This motivates our investigation into whether small, locally deployed LLMs (< 14B parameters) can meaningfully support autonomous highway…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics