SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval
Xin Xie, Dongyun Xue, Wuguannan Yao, Mingxiao Feng, Wengang Zhou, Xiang Qi, Houqiang Li, Peng Zhang

TL;DR
SGA-MCTS introduces a retrieval-based planning framework for LLMs that enables real-time, scalable reasoning by distilling high-quality trajectories into reusable primitives, matching state-of-the-art performance without fine-tuning.
Contribution
It presents a novel non-parametric retrieval approach using MCTS to create reusable symbolic primitives, enabling fast and effective LLM planning without task-specific training.
Findings
Matches GPT-5 performance on complex benchmarks
Enables real-time autonomous planning with frozen models
Amortizes search costs for scalable reasoning
Abstract
LLM-powered systems require complex multi-step decision-making abilities to solve real-world tasks, yet current planning approaches face a trade-off between the high latency of inference-time search and the limited generalization of supervised fine-tuning. To address this limitation, we introduce \textbf{SGA-MCTS}, a framework that casts LLM planning as non-parametric retrieval. Offline, we leverage Monte Carlo Tree Search (MCTS) to explore the solution space and distill high-fidelity trajectories into State-Goal-Action (SGA) atoms. These atoms are de-lexicalized primitives that abstract concrete entities into symbolic slots, preserving reusable causal logic while discarding domain-specific noise. Online, a retrieval-augmented agent employs a hybrid symbolic-semantic mechanism to fetch relevant SGAs and re-ground them into the current context as soft reasoning hints. Empirical results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
