Reinforcement Learning based Control of Imitative Policies for   Near-Accident Driving

Zhangjie Cao; Erdem B{\i}y{\i}k; Woodrow Z. Wang; Allan Raventos,; Adrien Gaidon; Guy Rosman; Dorsa Sadigh

arXiv:2007.00178·cs.LG·July 2, 2020·5 cites

Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving

Zhangjie Cao, Erdem B{\i}y{\i}k, Woodrow Z. Wang, Allan Raventos,, Adrien Gaidon, Guy Rosman, Dorsa Sadigh

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical reinforcement and imitation learning framework for autonomous driving in near-accident scenarios, enabling safer and more efficient decision-making by switching between specialized driving modes.

Contribution

The paper presents a novel hierarchical approach combining RL and IL to better handle high-risk driving situations with rapid state changes.

Findings

01

Higher safety and efficiency in near-accident scenarios

02

Effective switching between driving modes demonstrated

03

Improved policy performance over existing methods

Abstract

Autonomous driving has achieved significant progress in recent years, but autonomous cars are still unable to tackle high-risk situations where a potential accident is likely. In such near-accident scenarios, even a minor change in the vehicle's actions may result in drastically different consequences. To avoid unsafe actions in near-accident scenarios, we need to fully explore the environment. However, reinforcement learning (RL) and imitation learning (IL), two widely-used policy learning methods, cannot model rapid phase transitions and are not scalable to fully cover all the states. To address driving in near-accident scenarios, we propose a hierarchical reinforcement and imitation learning (H-ReIL) approach that consists of low-level policies learned by IL for discrete driving modes, and a high-level policy learned by RL that switches between different driving modes. Our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Stanford-ILIAD/CARLO
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Traffic control and management