Verifiable Reinforcement Learning Systems via Compositionality

Cyrus Neary; Aryaman Singh Samyal; Christos Verginis; Murat Cubuktepe,; Ufuk Topcu

arXiv:2309.06420·eess.SY·September 13, 2023

Verifiable Reinforcement Learning Systems via Compositionality

Cyrus Neary, Aryaman Singh Samyal, Christos Verginis, Murat Cubuktepe,, Ufuk Topcu

PDF

Open Access

TL;DR

This paper introduces a compositional framework for verifiable reinforcement learning, enabling independent training of subsystems with guarantees on overall task satisfaction through a high-level planning model.

Contribution

It presents a novel framework combining high-level planning with low-level deep RL subsystems, ensuring verifiability and compositionality in complex environments.

Findings

01

Framework guarantees overall task satisfaction if subsystems meet their specifications.

02

Automatic decomposition of task specifications into subtask requirements.

03

Experimental validation across diverse environments with partial observability.

Abstract

We propose a framework for verifiable and compositional reinforcement learning (RL) in which a collection of RL subsystems, each of which learns to accomplish a separate subtask, are composed to achieve an overall task. The framework consists of a high-level model, represented as a parametric Markov decision process, which is used to plan and analyze compositions of subsystems, and of the collection of low-level subsystems themselves. The subsystems are implemented as deep RL agents operating under partial observability. By defining interfaces between the subsystems, the framework enables automatic decompositions of task specifications, e.g., reach a target set of states with a probability of at least 0.95, into individual subtask specifications, i.e. achieve the subsystem's exit conditions with at least some minimum probability, given that its entry conditions are met. This in turn…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning