Reinforcement Learning with Human Feedback for Realistic Traffic   Simulation

Yulong Cao; Boris Ivanovic; Chaowei Xiao; Marco Pavone

arXiv:2309.00709·cs.AI·September 6, 2023

Reinforcement Learning with Human Feedback for Realistic Traffic Simulation

Yulong Cao, Boris Ivanovic, Chaowei Xiao, Marco Pavone

PDF

Open Access

TL;DR

This paper introduces TrafficRLHF, a reinforcement learning framework that uses human feedback to improve the realism of traffic simulations for autonomous vehicle testing, addressing the challenge of aligning models with human preferences.

Contribution

It presents the first dataset for traffic realism alignment and demonstrates a novel RLHF-based approach to generate human-aligned realistic traffic scenarios.

Findings

01

TrafficRLHF produces traffic scenarios aligned with human preferences.

02

The framework outperforms baseline models in realism evaluation.

03

The new dataset supports future research in traffic realism modeling.

Abstract

In light of the challenges and costs of real-world testing, autonomous vehicle developers often rely on testing in simulation for the creation of reliable systems. A key element of effective simulation is the incorporation of realistic traffic models that align with human knowledge, an aspect that has proven challenging due to the need to balance realism and diversity. This works aims to address this by developing a framework that employs reinforcement learning with human preference (RLHF) to enhance the realism of existing traffic models. This study also identifies two main challenges: capturing the nuances of human preferences on realism and the unification of diverse traffic simulation models. To tackle these issues, we propose using human feedback for alignment and employ RLHF due to its sample efficiency. We also introduce the first dataset for realism alignment in traffic modeling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Autonomous Vehicle Technology and Safety · Human-Automation Interaction and Safety

MethodsALIGN