Safety-Aware Apprenticeship Learning
Weichao Zhou, Wenchao Li

TL;DR
This paper introduces a safety-aware apprenticeship learning method that integrates probabilistic model checking to ensure safety properties are maintained during policy learning from demonstrations.
Contribution
It presents a novel counterexample-guided approach embedding probabilistic model checking into apprenticeship learning to guarantee safety without sacrificing learning performance.
Findings
Successfully ensures safety in apprenticeship learning scenarios
Retains high policy performance while enforcing safety constraints
Effective in complex, safety-critical environments
Abstract
Apprenticeship learning (AL) is a kind of Learning from Demonstration techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent has to derive a good policy by observing an expert's demonstrations. In this paper, we study the problem of how to make AL algorithms inherently safe while still meeting its learning objective. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure safety while retaining performance of the learnt policy. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
