Policies Grow on Trees: Model Checking Families of MDPs
Roman Andriushchenko, Milan \v{C}e\v{s}ka, Sebastian Junges, Filip, Mac\'ak

TL;DR
This paper presents a scalable method for synthesizing policies across families of MDPs using policy trees and game-based abstraction, significantly improving efficiency over naive approaches.
Contribution
It introduces policy trees and a recursive game-based abstraction for efficient synthesis over MDP families, with extensive empirical validation.
Findings
Successfully synthesized 246 policies covering 94 million MDPs
Achieved synthesis in less than 30 minutes for large families
Outperformed naive baselines by a large margin
Abstract
Markov decision processes (MDPs) provide a fundamental model for sequential decision making under process uncertainty. A classical synthesis task is to compute for a given MDP a winning policy that achieves a desired specification. However, at design time, one typically needs to consider a family of MDPs modelling various system variations. For a given family, we study synthesising (1) the subset of MDPs where a winning policy exists and (2) a preferably small number of winning policies that together cover this subset. We introduce policy trees that concisely capture the synthesis result. The key ingredient for synthesising policy trees is a recursive application of a game-based abstraction. We combine this abstraction with an efficient refinement procedure and a post-processing step. An extensive empirical evaluation demonstrates superior scalability of our approach compared to naive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Auction Theory and Applications · Model-Driven Software Engineering Techniques
