Are LLMs The Way Forward? A Case Study on LLM-Guided Reinforcement Learning for Decentralized Autonomous Driving
Timur Anvar, Jeffrey Chen, Yuyan Wang, Rohan Chandra

TL;DR
This study evaluates the potential of small, locally deployed LLMs to enhance reinforcement learning for autonomous highway driving by reward shaping, revealing benefits and notable limitations in safety-critical scenarios.
Contribution
It introduces a hybrid approach where small LLMs augment RL rewards, compares it with RL-only and LLM-only methods, and analyzes their effectiveness and limitations in autonomous driving.
Findings
RL-only agents achieve 73-89% success rates
LLM-only agents reach up to 94% success but are slower
Hybrid approaches perform between RL-only and LLM-only methods
Abstract
Autonomous vehicle navigation in complex environments such as dense and fast-moving highways and merging scenarios remains an active area of research. A key limitation of RL is its reliance on well-specified reward functions, which often fail to capture the full semantic and social complexity of diverse, out-of-distribution situations. As a result, a rapidly growing line of research explores using Large Language Models (LLMs) to replace or supplement RL for direct planning and control, on account of their ability to reason about rich semantic context. However, LLMs present significant drawbacks: they can be unstable in zero-shot safety-critical settings, produce inconsistent outputs, and often depend on expensive API calls with network latency. This motivates our investigation into whether small, locally deployed LLMs (< 14B parameters) can meaningfully support autonomous highway…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics
