PAC-MCTS: Bias-Aware Pruning for Robust LLM-Guided Search and Planning
Tianhao Qian

TL;DR
PAC-MCTS introduces a bias-aware pruning method for search and planning with large language models, providing formal safety guarantees and improving robustness and efficiency in complex environments.
Contribution
It formulates node expansion as a biased Best-Arm Identification problem, deriving sample complexity bounds and proposing a dynamic, bias-aware pruning framework for LLM-guided search.
Findings
PAC-MCTS reduces API evaluations by up to 78%.
It achieves over 3x higher sample efficiency under strict compute budgets.
Experiments validate robustness improvements with increasing bias.
Abstract
As search depth increases in autonomous reasoning and embodied planning, candidate action spaces expand exponentially, often exhausting computational budgets. While heuristic pruning is a critical countermeasure, existing approaches lack formal safety guarantees when guided by surrogate evaluators such as Large Language Models (LLMs), which exhibit systematic biases. We formulate node expansion as a localized Best-Arm Identification (BAI) problem under bounded bias and derive a sample complexity upper bound of , identifying as the regime where safe elimination is feasible. We further establish an information-theoretic lower bound of that characterizes the structural limits of biased exploration. Motivated by these results, we propose PAC-MCTS, a bias-aware pruning framework that dynamically adapts confidence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
