Probabilistic Verification of Recurrent Neural Networks for Single and Multi-Agent Reinforcement Learning

Luca Marzari; Enrico Marchesini

arXiv:2605.14758·cs.AI·May 15, 2026

Probabilistic Verification of Recurrent Neural Networks for Single and Multi-Agent Reinforcement Learning

Luca Marzari, Enrico Marchesini

PDF

TL;DR

RNN-ProVe is a probabilistic verification framework that estimates the likelihood of undesirable behaviors in RNN-based policies for partially observable reinforcement learning, providing scalable, high-confidence guarantees.

Contribution

It introduces a novel probabilistic approach that overcomes limitations of existing tools by using policy-driven sampling and statistical bounds for RNN verification in complex RL settings.

Findings

01

Provides more quantitative probabilistic guarantees than existing methods.

02

Scales effectively to recurrent and multi-agent reinforcement learning tasks.

03

Offers bounded-error, high-confidence estimates of behavioral violations.

Abstract

History-dependent policies induced by recurrent neural networks (RNNs) rely on latent hidden state dynamics, making verification in partially observable reinforcement learning (RL) challenging. Existing RNN verification tools typically rely on restrictive modeling assumptions or coarse over-approximations of the hidden state space, which can lead to overly conservative or inconclusive results. We propose $RNN$ $Pro$ babilistic $Ve$ rification ( $RNN-ProVe$ ), a probabilistic framework that $estimates the likelihood$ of undesired behaviors in RNN-based policies. $RNN-ProVe$ uses policy-driven sampling to approximate the set of hidden states that are feasible under a trained policy, and derives statistical error bounds to produce bounded-error, high-confidence estimates of behavioral violations. Experiments on partially observable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.