Topological Value Iteration Algorithms
Peng Dai, Mausam, Daniel Sabby Weld, Judy Goldsmith

TL;DR
This paper introduces two novel topological value iteration algorithms, TVI and FTVI, which leverage structural analysis of MDPs to significantly improve solution efficiency over existing methods.
Contribution
The paper presents the first algorithms that utilize the graphical structure of MDPs for systematic decomposition and efficient solving, outperforming current state-of-the-art approaches.
Findings
FTVI outperforms TVI by an order of magnitude.
FTVI significantly outperforms popular heuristically-informed algorithms.
The algorithms excel in domains with multiple, close-to-equal-sized strongly-connected components.
Abstract
Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs) because it puts the majority of its effort into backing up the entire state space, which turns out to be unnecessary in many cases. In order to overcome this problem, many approaches have been proposed. Among them, ILAO* and variants of RTDP are state-of-the-art ones. These methods use reachability analysis and heuristic search to avoid some unnecessary backups. However, none of these approaches build the graphical structure of the state transitions in a pre-processing step or use the structural information to systematically decompose a problem, whereby generating an intelligent backup sequence of the state space. In this paper, we present two optimal MDP algorithms. The first algorithm, topological value iteration (TVI), detects the structure of MDPs and backs up states based on topological…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · AI-based Problem Solving and Planning · Formal Methods in Verification
