Anytime Probabilistically Constrained Provably Convergent Online Belief Space Planning
Andrey Zhitnikov, Vadim Indelman

TL;DR
This paper introduces an anytime Monte Carlo Tree Search method for online belief space planning that guarantees safety and improves action quality in continuous domains without requiring search convergence.
Contribution
It presents a novel anytime MCTS algorithm for belief space planning that ensures safety and convergence in probability, with extensive simulation validation.
Findings
The method guarantees safety at any time during search.
It outperforms baselines in safety and objective metrics.
The approach effectively revises search tree values after pruning.
Abstract
Taking into account future risk is essential for an autonomously operating robot to find online not only the best but also a safe action to execute. In this paper, we build upon the recently introduced formulation of probabilistic belief-dependent constraints. We present an anytime approach employing the Monte Carlo Tree Search (MCTS) method in continuous domains. Unlike previous approaches, our method assures safety anytime with respect to the currently expanded search tree without relying on the convergence of the search. We prove convergence in probability with an exponential rate of a version of our algorithms and study proposed techniques via extensive simulations. Even with a tiny number of tree queries, the best action found by our approach is much safer than the baseline. Moreover, our approach constantly finds better than the baseline action in terms of objective. This is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Energy Efficient Wireless Sensor Networks · Cryptography and Data Security
