SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

Xin Xie; Dongyun Xue; Wuguannan Yao; Mingxiao Feng; Wengang Zhou; Xiang Qi; Houqiang Li; Peng Zhang

arXiv:2604.14712·cs.AI·April 17, 2026

SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

Xin Xie, Dongyun Xue, Wuguannan Yao, Mingxiao Feng, Wengang Zhou, Xiang Qi, Houqiang Li, Peng Zhang

PDF

TL;DR

SGA-MCTS introduces a retrieval-based planning framework for LLMs that enables real-time, scalable reasoning by distilling high-quality trajectories into reusable primitives, matching state-of-the-art performance without fine-tuning.

Contribution

It presents a novel non-parametric retrieval approach using MCTS to create reusable symbolic primitives, enabling fast and effective LLM planning without task-specific training.

Findings

01

Matches GPT-5 performance on complex benchmarks

02

Enables real-time autonomous planning with frozen models

03

Amortizes search costs for scalable reasoning

Abstract

LLM-powered systems require complex multi-step decision-making abilities to solve real-world tasks, yet current planning approaches face a trade-off between the high latency of inference-time search and the limited generalization of supervised fine-tuning. To address this limitation, we introduce \textbf{SGA-MCTS}, a framework that casts LLM planning as non-parametric retrieval. Offline, we leverage Monte Carlo Tree Search (MCTS) to explore the solution space and distill high-fidelity trajectories into State-Goal-Action (SGA) atoms. These atoms are de-lexicalized primitives that abstract concrete entities into symbolic slots, preserving reusable causal logic while discarding domain-specific noise. Online, a retrieval-augmented agent employs a hybrid symbolic-semantic mechanism to fetch relevant SGAs and re-ground them into the current context as soft reasoning hints. Empirical results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.