Probabilistic Verification of Recurrent Neural Networks for Single and Multi-Agent Reinforcement Learning
Luca Marzari, Enrico Marchesini

TL;DR
RNN-ProVe is a probabilistic verification framework that estimates the likelihood of undesirable behaviors in RNN-based policies for partially observable reinforcement learning, providing scalable, high-confidence guarantees.
Contribution
It introduces a novel probabilistic approach that overcomes limitations of existing tools by using policy-driven sampling and statistical bounds for RNN verification in complex RL settings.
Findings
Provides more quantitative probabilistic guarantees than existing methods.
Scales effectively to recurrent and multi-agent reinforcement learning tasks.
Offers bounded-error, high-confidence estimates of behavioral violations.
Abstract
History-dependent policies induced by recurrent neural networks (RNNs) rely on latent hidden state dynamics, making verification in partially observable reinforcement learning (RL) challenging. Existing RNN verification tools typically rely on restrictive modeling assumptions or coarse over-approximations of the hidden state space, which can lead to overly conservative or inconclusive results. We propose babilistic rification (), a probabilistic framework that of undesired behaviors in RNN-based policies. uses policy-driven sampling to approximate the set of hidden states that are feasible under a trained policy, and derives statistical error bounds to produce bounded-error, high-confidence estimates of behavioral violations. Experiments on partially observable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
