An Abstraction-based Method to Check Multi-Agent Deep Reinforcement-Learning Behaviors
Pierre El Mqirmi, Francesco Belardinelli, Borja G. Le\'on

TL;DR
This paper introduces an abstraction-based approach that combines formal verification with deep reinforcement learning to ensure agents adhere to safety constraints in multi-agent systems, applicable during training and testing.
Contribution
It presents a novel method that uses formal verification and abstract modeling to guarantee safety constraints in multi-agent deep RL, with automatic model generation and formal guarantees.
Findings
Effective in ensuring safety constraints in multi-agent environments
Reduces verification complexity through abstraction
Provides formal guarantees of safety compliance
Abstract
Multi-agent reinforcement learning (RL) often struggles to ensure the safe behaviours of the learning agents, and therefore it is generally not adapted to safety-critical applications. To address this issue, we present a methodology that combines formal verification with (deep) RL algorithms to guarantee the satisfaction of formally-specified safety constraints both in training and testing. The approach we propose expresses the constraints to verify in Probabilistic Computation Tree Logic (PCTL) and builds an abstract representation of the system to reduce the complexity of the verification step. This abstract model allows for model checking techniques to identify a set of abstract policies that meet the safety constraints expressed in PCTL. Then, the agents' behaviours are restricted according to these safe abstract policies. We provide formal guarantees that by using this method, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Reinforcement Learning in Robotics · Software Testing and Debugging Techniques
