Reinforcement Learning with Iterative Reasoning for Merging in Dense   Traffic

Maxime Bouton; Alireza Nakhaei; David Isele; Kikuo Fujimura; and Mykel; J. Kochenderfer

arXiv:2005.11895·cs.AI·May 26, 2020

Reinforcement Learning with Iterative Reasoning for Merging in Dense Traffic

Maxime Bouton, Alireza Nakhaei, David Isele, Kikuo Fujimura, and Mykel, J. Kochenderfer

PDF

TL;DR

This paper introduces a novel approach combining reinforcement learning and game theory to improve autonomous vehicle merging in dense traffic by training with level-$k$ behavior to enhance robustness and efficiency.

Contribution

It proposes a new training curriculum using level-$k$ reasoning in reinforcement learning for autonomous merging in dense traffic, improving policy robustness.

Findings

01

Learned policies are more efficient than traditional methods.

02

Training with level-$k$ behavior enhances robustness to diverse traffic scenarios.

03

Approach outperforms baseline methods in simulation.

Abstract

Maneuvering in dense traffic is a challenging task for autonomous vehicles because it requires reasoning about the stochastic behaviors of many other participants. In addition, the agent must achieve the maneuver within a limited time and distance. In this work, we propose a combination of reinforcement learning and game theory to learn merging behaviors. We design a training curriculum for a reinforcement learning agent using the concept of level- $k$ behavior. This approach exposes the agent to a broad variety of behaviors during training, which promotes learning policies that are robust to model discrepancies. We show that our approach learns more efficient policies than traditional training methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.