Incremental Pruning: A Simple, Fast, Exact Method for Partially   Observable Markov Decision Processes

Anthony R. Cassandra; Michael L. Littman; Nevin Lianwen Zhang

arXiv:1302.1525·cs.AI·February 8, 2013·346 cites

Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes

Anthony R. Cassandra, Michael L. Littman, Nevin Lianwen Zhang

PDF

Open Access

TL;DR

Incremental pruning is presented as the most efficient exact algorithm for solving POMDPs, offering a simple and fast approach based on dynamic programming with piecewise-linear convex value functions.

Contribution

The paper introduces and evaluates variations of the incremental pruning method, demonstrating its superior efficiency over previous algorithms for POMDPs.

Findings

01

Incremental pruning outperforms earlier algorithms in efficiency.

02

It is the most effective exact method for solving POMDPs.

03

The method simplifies the dynamic programming process for POMDPs.

Abstract

Most exact algorithms for general partially observable Markov decision processes (POMDPs) use a form of dynamic programming in which a piecewise-linear and convex representation of one value function is transformed into another. We examine variations of the "incremental pruning" method for solving this problem and compare them to earlier algorithms from theoretical and empirical perspectives. We find that incremental pruning is presently the most efficient exact method for solving POMDPs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Reinforcement Learning in Robotics · Machine Learning and Algorithms