Generating and Evolving Reward Functions for Highway Driving with Large   Language Models

Xu Han; Qiannan Yang; Xianda Chen; Xiaowen Chu; Meixin Zhu

arXiv:2406.10540·cs.AI·June 18, 2024

Generating and Evolving Reward Functions for Highway Driving with Large Language Models

Xu Han, Qiannan Yang, Xianda Chen, Xiaowen Chu, Meixin Zhu

PDF

Open Access

TL;DR

This paper presents a novel framework that leverages Large Language Models to automatically generate and refine reward functions for autonomous highway driving, significantly improving success rates over handcrafted rewards.

Contribution

Introducing a framework that uses LLMs to generate and evolve reward functions for autonomous driving, reducing manual effort and enhancing performance.

Findings

01

Achieved 22% higher success rate than handcrafted reward functions.

02

Demonstrated effective reward function generation in highway scenarios.

03

Improved development productivity for autonomous driving systems.

Abstract

Reinforcement Learning (RL) plays a crucial role in advancing autonomous driving technologies by maximizing reward functions to achieve the optimal policy. However, crafting these reward functions has been a complex, manual process in many practices. To reduce this complexity, we introduce a novel framework that integrates Large Language Models (LLMs) with RL to improve reward function design in autonomous driving. This framework utilizes the coding capabilities of LLMs, proven in other areas, to generate and evolve reward functions for highway scenarios. The framework starts with instructing LLMs to create an initial reward function code based on the driving environment and task descriptions. This code is then refined through iterative cycles involving RL training and LLMs' reflection, which benefits from their ability to review and improve the output. We have also developed a specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Traffic Prediction and Management Techniques · Natural Language Processing Techniques