PAC-MCTS: Bias-Aware Pruning for Robust LLM-Guided Search and Planning

Tianhao Qian

arXiv:2604.14345·cs.LG·May 12, 2026

PAC-MCTS: Bias-Aware Pruning for Robust LLM-Guided Search and Planning

Tianhao Qian

PDF

TL;DR

PAC-MCTS introduces a bias-aware pruning method for search and planning with large language models, providing formal safety guarantees and improving robustness and efficiency in complex environments.

Contribution

It formulates node expansion as a biased Best-Arm Identification problem, deriving sample complexity bounds and proposing a dynamic, bias-aware pruning framework for LLM-guided search.

Findings

01

PAC-MCTS reduces API evaluations by up to 78%.

02

It achieves over 3x higher sample efficiency under strict compute budgets.

03

Experiments validate robustness improvements with increasing bias.

Abstract

As search depth increases in autonomous reasoning and embodied planning, candidate action spaces expand exponentially, often exhausting computational budgets. While heuristic pruning is a critical countermeasure, existing approaches lack formal safety guarantees when guided by surrogate evaluators such as Large Language Models (LLMs), which exhibit systematic biases. We formulate node expansion as a localized Best-Arm Identification (BAI) problem under bounded bias $L$ and derive a sample complexity upper bound of $O ((Δ - 4 L)^{- 2})$ , identifying $Δ > 4 L$ as the regime where safe elimination is feasible. We further establish an information-theoretic lower bound of $Ω ((Δ - 2 L)^{- 2})$ that characterizes the structural limits of biased exploration. Motivated by these results, we propose PAC-MCTS, a bias-aware pruning framework that dynamically adapts confidence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.