Formally Verified Solution Methods for Infinite-Horizon Markov Decision   Processes

Maximilian Sch\"afeller; Mohammad Abdulaziz

arXiv:2206.02169·cs.AI·March 9, 2023·1 cites

Formally Verified Solution Methods for Infinite-Horizon Markov Decision Processes

Maximilian Sch\"afeller, Mohammad Abdulaziz

PDF

Open Access

TL;DR

This paper presents formally verified algorithms for solving infinite-horizon Markov decision processes using Isabelle/HOL, ensuring correctness and practical efficiency in comparison to existing systems.

Contribution

It introduces formally verified dynamic programming algorithms for infinite-horizon MDPs within a theorem prover, combining formal correctness with practical performance.

Findings

01

Verified algorithms solve standard MDP problems effectively

02

Formalization ensures correctness of the Bellman equation and policies

03

System can outperform some state-of-the-art solutions

Abstract

We formally verify executable algorithms for solving Markov decision processes (MDPs) in the interactive theorem prover Isabelle/HOL. We build on existing formalizations of probability theory to analyze the expected total reward criterion on infinite-horizon problems. Our developments formalize the Bellman equation and give conditions under which optimal policies exist. Based on this analysis, we verify dynamic programming algorithms to solve tabular MDPs. We evaluate the formally verified implementations experimentally on standard problems and show they are practical. Furthermore, we show that, combined with efficient unverified implementations, our system can compete with and even outperform state-of-the-art systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, Reasoning, and Knowledge · Formal Methods in Verification · Bayesian Modeling and Causal Inference