Verifiable and Compositional Reinforcement Learning Systems

Cyrus Neary; Christos Verginis; Murat Cubuktepe; Ufuk Topcu

arXiv:2106.05864·cs.LG·May 16, 2022

Verifiable and Compositional Reinforcement Learning Systems

Cyrus Neary, Christos Verginis, Murat Cubuktepe, Ufuk Topcu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a framework for verifiable, compositional reinforcement learning that enables independent training of subsystems with guarantees on overall task satisfaction through formal interfaces and iterative specification refinement.

Contribution

The framework combines a high-level pMDP model with low-level RL subsystems, allowing automatic decomposition, independent training, and iterative adjustment of subtask specifications for compositional RL.

Findings

01

Guarantees overall task satisfaction if subsystems meet subtask specs.

02

Automatic decomposition of complex tasks into manageable subtasks.

03

Iterative refinement improves subsystem policies and task success.

Abstract

We propose a framework for verifiable and compositional reinforcement learning (RL) in which a collection of RL subsystems, each of which learns to accomplish a separate subtask, are composed to achieve an overall task. The framework consists of a high-level model, represented as a parametric Markov decision process (pMDP) which is used to plan and to analyze compositions of subsystems, and of the collection of low-level subsystems themselves. By defining interfaces between the subsystems, the framework enables automatic decompositions of task specifications, e.g., reach a target set of states with a probability of at least 0.95, into individual subtask specifications, i.e. achieve the subsystem's exit conditions with at least some minimum probability, given that its entry conditions are met. This in turn allows for the independent training and testing of the subsystems; if they each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cyrusneary/verifiable-compositional-rl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics