Formally Verified Approximate Policy Iteration

Maximilian Sch\"affeler; Mohammad Abdulaziz

arXiv:2406.07340·cs.AI·June 12, 2024

Formally Verified Approximate Policy Iteration

Maximilian Sch\"affeler, Mohammad Abdulaziz

PDF

Open Access 1 Video

TL;DR

This paper presents a formally verified approximate policy iteration algorithm for Factored Markov Decision Processes, including a verified implementation and software for certifying Linear Programming solutions, demonstrating practical applicability.

Contribution

It introduces a formally verified algorithm for approximate policy iteration and develops verified software for Linear Programming certification, advancing formal methods in decision process algorithms.

Findings

01

Verified algorithm successfully applied to benchmark problems

02

Implementation demonstrates practical viability

03

Enhanced formal methods for interactive theorem proving

Abstract

We formally verify an algorithm for approximate policy iteration on Factored Markov Decision Processes using the interactive theorem prover Isabelle/HOL. Next, we show how the formalized algorithm can be refined to an executable, verified implementation. The implementation is evaluated on benchmark problems to show its practicability. As part of the refinement, we develop verified software to certify Linear Programming solutions. The algorithm builds on a diverse library of formalized mathematics and pushes existing methodologies for interactive theorem provers to the limits. We discuss the process of the verification project and the modifications to the algorithm needed for formal verification.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Formally Verified Approximate Policy Iteration· underline

Taxonomy

TopicsOptimization and Search Problems · Age of Information Optimization

MethodsLib