DESPOT: Online POMDP Planning with Regularization

Nan Ye; Adhiraj Somani; David Hsu; Wee Sun Lee

arXiv:1609.03250·cs.AI·September 20, 2017

DESPOT: Online POMDP Planning with Regularization

Nan Ye, Adhiraj Somani, David Hsu, Wee Sun Lee

PDF

1 Repo

TL;DR

The paper introduces DESPOT, an online POMDP planning algorithm that uses regularization to balance policy complexity and performance, enabling efficient real-time decision making in uncertain environments.

Contribution

It proposes a novel sparse belief tree approximation called DESPOT and an anytime planning algorithm with regularization to improve online POMDP solutions.

Findings

01

Strong experimental performance compared to existing algorithms

02

Effective regularization prevents overfitting in policy search

03

Successfully integrated into autonomous driving for real-time control

Abstract

The partially observable Markov decision process (POMDP) provides a principled general framework for planning under uncertainty, but solving POMDPs optimally is computationally intractable, due to the "curse of dimensionality" and the "curse of history". To overcome these challenges, we introduce the Determinized Sparse Partially Observable Tree (DESPOT), a sparse approximation of the standard belief tree, for online planning under uncertainty. A DESPOT focuses online planning on a set of randomly sampled scenarios and compactly captures the "execution" of all policies under these scenarios. We show that the best policy obtained from a DESPOT is near-optimal, with a regret bound that depends on the representation size of the optimal policy. Leveraging this result, we give an anytime online planning algorithm, which searches a DESPOT for a policy that optimizes a regularized objective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AdaCompNUS/despot
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.