No Compromise in Solution Quality: Speeding Up Belief-dependent   Continuous POMDPs via Adaptive Multilevel Simplification

Andrey Zhitnikov; Ori Sztyglic; Vadim Indelman

arXiv:2310.10274·cs.AI·May 24, 2024·1 cites

No Compromise in Solution Quality: Speeding Up Belief-dependent Continuous POMDPs via Adaptive Multilevel Simplification

Andrey Zhitnikov, Ori Sztyglic, Vadim Indelman

PDF

Open Access

TL;DR

This paper introduces adaptive multilevel simplification techniques for continuous POMDPs with belief-dependent rewards, enabling faster online planning without sacrificing solution quality, supported by rigorous theory and practical algorithms.

Contribution

It presents a complete theoretical framework and three algorithms for accelerating belief-dependent continuous POMDP planning with provable guarantees and no loss in optimality.

Findings

01

Significant speedup in planning demonstrated in simulations

02

Guarantees of identical optimal actions with simplified methods

03

Novel adaptive bounds for information-theoretic rewards

Abstract

Continuous POMDPs with general belief-dependent rewards are notoriously difficult to solve online. In this paper, we present a complete provable theory of adaptive multilevel simplification for the setting of a given externally constructed belief tree and MCTS that constructs the belief tree on the fly using an exploration technique. Our theory allows to accelerate POMDP planning with belief-dependent rewards without any sacrifice in the quality of the obtained solution. We rigorously prove each theoretical claim in the proposed unified theory. Using the general theoretical results, we present three algorithms to accelerate continuous POMDP online planning with belief-dependent rewards. Our two algorithms, SITH-BSP and LAZY-SITH-BSP, can be utilized on top of any method that constructs a belief tree externally. The third algorithm, SITH-PFT, is an anytime MCTS method that permits to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · AI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge