Adaptive Discretization using Voronoi Trees for Continuous-Action POMDPs

Marcus Hoerger; Hanna Kurniawati; Dirk Kroese; Nan Ye

arXiv:2209.05733·cs.AI·September 14, 2022·1 cites

Adaptive Discretization using Voronoi Trees for Continuous-Action POMDPs

Marcus Hoerger, Hanna Kurniawati, Dirk Kroese, Nan Ye

PDF

Open Access 1 Repo

TL;DR

This paper introduces ADVT, an adaptive discretization method using Voronoi trees for solving high-dimensional continuous-action POMDPs efficiently by combining Monte Carlo Tree Search with hierarchical action space partitioning.

Contribution

It proposes a novel Voronoi tree-based adaptive discretization technique integrated with Monte Carlo Tree Search for continuous-action POMDPs, improving scalability and solution quality.

Findings

01

ADVT outperforms existing solvers on benchmark problems.

02

It scales better to high-dimensional action spaces.

03

It achieves more efficient and adaptive action space discretization.

Abstract

Solving Partially Observable Markov Decision Processes (POMDPs) with continuous actions is challenging, particularly for high-dimensional action spaces. To alleviate this difficulty, we propose a new sampling-based online POMDP solver, called Adaptive Discretization using Voronoi Trees (ADVT). It uses Monte Carlo Tree Search in combination with an adaptive discretization of the action space as well as optimistic optimization to efficiently sample high-dimensional continuous action spaces and compute the best action to perform. Specifically, we adaptively discretize the action space for each sampled belief using a hierarchical partition which we call a Voronoi tree. A Voronoi tree is a Binary Space Partitioning (BSP) that implicitly maintains the partition of a cell as the Voronoi diagram of two points sampled from the cell. This partitioning strategy keeps the cost of partitioning and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hoergems/advt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Markov Chains and Monte Carlo Methods