Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

Shai Shalev-Shwartz; Shaked Shammah; Amnon Shashua

arXiv:1610.03295·cs.AI·October 12, 2016·367 cites

Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

Shai Shalev-Shwartz, Shaked Shammah, Amnon Shashua

PDF

Open Access

TL;DR

This paper presents a deep reinforcement learning approach for autonomous driving in multi-agent urban environments, focusing on safety, long-term strategy formation, and reducing variance in policy gradient methods.

Contribution

It introduces a method to apply policy gradient iterations without Markovian assumptions, decomposes the problem into desires and constraints, and proposes an 'Option Graph' for hierarchical temporal abstraction.

Findings

01

Policy gradient can be used without Markov assumptions.

02

Decomposition into desires and constraints improves safety and comfort.

03

Hierarchical 'Option Graph' reduces gradient variance.

Abstract

Autonomous driving is a multi-agent setting where the host vehicle must apply sophisticated negotiation skills with other road users when overtaking, giving way, merging, taking left and right turns and while pushing ahead in unstructured urban roadways. Since there are many possible scenarios, manually tackling all possible cases will likely yield a too simplistic policy. Moreover, one must balance between unexpected behavior of other drivers/pedestrians and at the same time not to be too defensive so that normal traffic flow is maintained. In this paper we apply deep reinforcement learning to the problem of forming long term driving strategies. We note that there are two major challenges that make autonomous driving different from other robotic tasks. First, is the necessity for ensuring functional safety - something that machine learning has difficulty with given that performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Transportation and Mobility Innovations