Assume-Guarantee Reinforcement Learning

Milad Kazemi; Mateo Perez; Fabio Somenzi; Sadegh Soudjani; Ashutosh; Trivedi; Alvaro Velasquez

arXiv:2312.09938·cs.LG·December 18, 2023·1 cites

Assume-Guarantee Reinforcement Learning

Milad Kazemi, Mateo Perez, Fabio Somenzi, Sadegh Soudjani, Ashutosh, Trivedi, Alvaro Velasquez

PDF

Open Access

TL;DR

This paper introduces a modular reinforcement learning framework using assume-guarantee contracts expressed as regular languages, enabling scalable learning in complex environments by synthesizing local controllers with guarantees on system-wide satisfaction probabilities.

Contribution

It proposes a novel assume-guarantee paradigm for RL, translating contracts into rewards and providing a method to compute lower bounds on system satisfaction probabilities.

Findings

01

Efficiently synthesizes local controllers in modular environments.

02

Provides a lower bound on the probability of system satisfaction.

03

Demonstrates effectiveness on various case studies.

Abstract

We present a modular approach to \emph{reinforcement learning} (RL) in environments consisting of simpler components evolving in parallel. A monolithic view of such modular environments may be prohibitively large to learn, or may require unrealizable communication between the components in the form of a centralized controller. Our proposed approach is based on the assume-guarantee paradigm where the optimal control for the individual components is synthesized in isolation by making \emph{assumptions} about the behaviors of neighboring components, and providing \emph{guarantees} about their own behavior. We express these \emph{assume-guarantee contracts} as regular languages and provide automatic translations to scalar rewards to be used in RL. By combining local probabilities of satisfaction for each component, we provide a lower bound on the probability of satisfaction of the complete…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Computability, Logic, AI Algorithms