Good-for-MDP State Reduction for Stochastic LTL Planning
Christoph Weinhuber, Giuseppe De Giacomo, Yong Li, Sven Schewe, Qiyi Tang

TL;DR
This paper introduces a novel state-space reduction technique for good-for-MDP automata used in stochastic LTL planning, significantly improving scalability and efficiency in policy synthesis.
Contribution
We propose a new GFM automata state reduction method leveraging recent minimisation advances and a specialized construction for G F formulas, enhancing scalability in stochastic LTL planning.
Findings
Automata state reduction leads to more scalable planning.
Specialized G F formula construction is single-exponential.
Empirical results confirm improved scalability.
Abstract
We study stochastic planning problems in Markov Decision Processes (MDPs) with goals specified in Linear Temporal Logic (LTL). The state-of-the-art approach transforms LTL formulas into good-for-MDP (GFM) automata, which feature a restricted form of nondeterminism. These automata are then composed with the MDP, allowing the agent to resolve the nondeterminism during policy synthesis. A major factor affecting the scalability of this approach is the size of the generated automata. In this paper, we propose a novel GFM state-space reduction technique that significantly reduces the number of automata states. Our method employs a sophisticated chain of transformations, leveraging recent advances in good-for-games minimisation developed for adversarial settings. In addition to our theoretical contributions, we present empirical results demonstrating the practical effectiveness of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsFormal Methods in Verification · Reinforcement Learning in Robotics · Logic, Reasoning, and Knowledge
