Human-centric Reward Optimization for Reinforcement Learning-based   Automated Driving using Large Language Models

Ziqi Zhou; Jingyue Zhang; Jingyuan Zhang; Yangfan He; Boyue Wang,; Tianyu Shi; Alaa Khamis

arXiv:2405.04135·cs.AI·December 30, 2024·2 cites

Human-centric Reward Optimization for Reinforcement Learning-based Automated Driving using Large Language Models

Ziqi Zhou, Jingyue Zhang, Jingyuan Zhang, Yangfan He, Boyue Wang,, Tianyu Shi, Alaa Khamis

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel method using large language models to optimize reinforcement learning rewards for automated driving, resulting in more human-like behavior and improved performance.

Contribution

Introduces a human-centric reward optimization framework leveraging LLMs to enhance RL-based automated driving agents.

Findings

01

LLMs can effectively generate rewards that promote human-like driving behavior

02

Prompt design significantly influences the behavior of RL agents

03

The approach improves both anthropomorphism and performance of automated driving systems

Abstract

One of the key challenges in current Reinforcement Learning (RL)-based Automated Driving (AD) agents is achieving flexible, precise, and human-like behavior cost-effectively. This paper introduces an innovative approach that uses large language models (LLMs) to intuitively and effectively optimize RL reward functions in a human-centric way. We developed a framework where instructions and dynamic environment descriptions are input into the LLM. The LLM then utilizes this information to assist in generating rewards, thereby steering the behavior of RL agents towards patterns that more closely resemble human driving. The experimental results demonstrate that this approach not only makes RL agents more anthropomorphic but also achieves better performance. Additionally, various strategies for reward-proxy and reward-shaping are investigated, revealing the significant impact of prompt design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jingyue2000/in-context_learning_for_automated_driving
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Anomaly Detection Techniques and Applications · Reinforcement Learning in Robotics