Verifiable Reinforcement Learning Systems via Compositionality
Cyrus Neary, Aryaman Singh Samyal, Christos Verginis, Murat Cubuktepe,, Ufuk Topcu

TL;DR
This paper introduces a compositional framework for verifiable reinforcement learning, enabling independent training of subsystems with guarantees on overall task satisfaction through a high-level planning model.
Contribution
It presents a novel framework combining high-level planning with low-level deep RL subsystems, ensuring verifiability and compositionality in complex environments.
Findings
Framework guarantees overall task satisfaction if subsystems meet their specifications.
Automatic decomposition of task specifications into subtask requirements.
Experimental validation across diverse environments with partial observability.
Abstract
We propose a framework for verifiable and compositional reinforcement learning (RL) in which a collection of RL subsystems, each of which learns to accomplish a separate subtask, are composed to achieve an overall task. The framework consists of a high-level model, represented as a parametric Markov decision process, which is used to plan and analyze compositions of subsystems, and of the collection of low-level subsystems themselves. The subsystems are implemented as deep RL agents operating under partial observability. By defining interfaces between the subsystems, the framework enables automatic decompositions of task specifications, e.g., reach a target set of states with a probability of at least 0.95, into individual subtask specifications, i.e. achieve the subsystem's exit conditions with at least some minimum probability, given that its entry conditions are met. This in turn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
