A Bayesian Approach to Online Planning
Nir Greshler, David Ben Eli, Carmel Rabinovitz, Gabi Guetta, Liran, Gispan, Guy Zohar, Aviv Tamar

TL;DR
This paper introduces a Bayesian online planning method that uses uncertainty estimates from neural networks to improve decision-making in complex environments, providing theoretical guarantees and empirical validation.
Contribution
It develops a Bayesian planning algorithm with finite-time regret bounds and demonstrates its effectiveness over existing methods in challenging environments.
Findings
Bayesian approach improves search efficiency when uncertainty estimates are accurate.
Proposed algorithms outperform traditional methods in ProcGen Maze and Leaper environments.
Uncertainty estimation quality significantly impacts planning performance.
Abstract
The combination of Monte Carlo tree search and neural networks has revolutionized online planning. As neural network approximations are often imperfect, we ask whether uncertainty estimates about the network outputs could be used to improve planning. We develop a Bayesian planning approach that facilitates such uncertainty quantification, inspired by classical ideas from the meta-reasoning literature. We propose a Thompson sampling based algorithm for searching the tree of possible actions, for which we prove the first (to our knowledge) finite time Bayesian regret bound, and propose an efficient implementation for a restricted family of posterior distributions. In addition we propose a variant of the Bayes-UCB method applied to trees. Empirically, we demonstrate that on the ProcGen Maze and Leaper environments, when the uncertainty estimates are accurate but the neural network output…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWikis in Education and Collaboration · Teaching and Learning Programming · Educational Games and Gamification
