Policy Verification in Stochastic Dynamical Systems Using Logarithmic Neural Certificates
Thom Badings, Wietze Koops, Sebastian Junges, Nils Jansen

TL;DR
This paper introduces a novel approach using logarithmic neural certificates to verify neural network policies in stochastic systems, enabling high-probability reach-avoid guarantees with improved Lipschitz constant bounds.
Contribution
It proposes logarithmic RASMs and a fast method for tighter Lipschitz bounds, improving verification of stochastic policies over existing methods.
Findings
Successfully verified reach-avoid specifications with probabilities up to 99.9999%
Logarithmic RASMs have exponentially smaller values and lower Lipschitz constants
Tighter Lipschitz bounds improve verification accuracy and efficiency
Abstract
We consider the verification of neural network policies for discrete-time stochastic systems with respect to reach-avoid specifications. We use a learner-verifier procedure that learns a certificate for the specification, represented as a neural network. Verifying that this neural network certificate is a so-called reach-avoid supermartingale (RASM) proves the satisfaction of a reach-avoid specification. Existing approaches for such a verification task rely on computed Lipschitz constants of neural networks. These approaches struggle with large Lipschitz constants, especially for reach-avoid specifications with high threshold probabilities. We present two key contributions to obtain smaller Lipschitz constants than existing approaches. First, we introduce logarithmic RASMs (logRASMs), which take exponentially smaller values than RASMs and hence have lower theoretical Lipschitz…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
