An Abstraction-based Method to Check Multi-Agent Deep   Reinforcement-Learning Behaviors

Pierre El Mqirmi; Francesco Belardinelli; Borja G. Le\'on

arXiv:2102.01434·cs.MA·April 5, 2021·5 cites

An Abstraction-based Method to Check Multi-Agent Deep Reinforcement-Learning Behaviors

Pierre El Mqirmi, Francesco Belardinelli, Borja G. Le\'on

PDF

Open Access

TL;DR

This paper introduces an abstraction-based approach that combines formal verification with deep reinforcement learning to ensure agents adhere to safety constraints in multi-agent systems, applicable during training and testing.

Contribution

It presents a novel method that uses formal verification and abstract modeling to guarantee safety constraints in multi-agent deep RL, with automatic model generation and formal guarantees.

Findings

01

Effective in ensuring safety constraints in multi-agent environments

02

Reduces verification complexity through abstraction

03

Provides formal guarantees of safety compliance

Abstract

Multi-agent reinforcement learning (RL) often struggles to ensure the safe behaviours of the learning agents, and therefore it is generally not adapted to safety-critical applications. To address this issue, we present a methodology that combines formal verification with (deep) RL algorithms to guarantee the satisfaction of formally-specified safety constraints both in training and testing. The approach we propose expresses the constraints to verify in Probabilistic Computation Tree Logic (PCTL) and builds an abstract representation of the system to reduce the complexity of the verification step. This abstract model allows for model checking techniques to identify a set of abstract policies that meet the safety constraints expressed in PCTL. Then, the agents' behaviours are restricted according to these safe abstract policies. We provide formal guarantees that by using this method, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Reinforcement Learning in Robotics · Software Testing and Debugging Techniques