Weakly Supervised Reinforcement Learning for Autonomous Highway Driving   via Virtual Safety Cages

Sampo Kuutti; Richard Bowden; Saber Fallah

arXiv:2103.09726·cs.LG·March 18, 2021

Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages

Sampo Kuutti, Richard Bowden, Saber Fallah

PDF

1 Repo

TL;DR

This paper introduces a reinforcement learning approach for autonomous highway driving that uses rule-based safety cages as weak supervision, improving safety, convergence speed, and interpretability of the control policies.

Contribution

It proposes a novel weak supervision method using safety cages to guide reinforcement learning in autonomous driving, enhancing safety and interpretability.

Findings

01

Safety cages improve exploration safety.

02

Weak supervision accelerates training convergence.

03

Models with safety cages perform better under sub-optimal parameters.

Abstract

The use of neural networks and reinforcement learning has become increasingly popular in autonomous vehicle control. However, the opaqueness of the resulting control policies presents a significant barrier to deploying neural network-based control in autonomous vehicles. In this paper, we present a reinforcement learning based approach to autonomous vehicle longitudinal control, where the rule-based safety cages provide enhanced safety for the vehicle as well as weak supervision to the reinforcement learning agent. By guiding the agent to meaningful states and actions, this weak supervision improves the convergence during training and enhances the safety of the final trained policy. This rule-based supervisory controller has the further advantage of being fully interpretable, thereby enabling traditional validation and verification approaches to ensure the safety of the vehicle. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sampo-kuutti/weakly-supervised-rl-highway-driving
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.