Safety-Aware Apprenticeship Learning

Weichao Zhou; Wenchao Li

arXiv:1710.07983·cs.AI·May 1, 2018·1 cites

Safety-Aware Apprenticeship Learning

Weichao Zhou, Wenchao Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a safety-aware apprenticeship learning method that integrates probabilistic model checking to ensure safety properties are maintained during policy learning from demonstrations.

Contribution

It presents a novel counterexample-guided approach embedding probabilistic model checking into apprenticeship learning to guarantee safety without sacrificing learning performance.

Findings

01

Successfully ensures safety in apprenticeship learning scenarios

02

Retains high policy performance while enforcing safety constraints

03

Effective in complex, safety-critical environments

Abstract

Apprenticeship learning (AL) is a kind of Learning from Demonstration techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent has to derive a good policy by observing an expert's demonstrations. In this paper, we study the problem of how to make AL algorithms inherently safe while still meeting its learning objective. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure safety while retaining performance of the learnt policy. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zwc662/CAV2018
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning