Policy Guided Tree Search for Enhanced LLM Reasoning

Yang Li

arXiv:2502.06813·cs.LG·February 12, 2025

Policy Guided Tree Search for Enhanced LLM Reasoning

Yang Li

PDF

Open Access

TL;DR

This paper introduces Policy-Guided Tree Search (PGTS), a reinforcement learning framework that improves large language model reasoning by dynamically guiding structured exploration, outperforming existing heuristic-based methods in efficiency and accuracy.

Contribution

The paper presents a novel reinforcement learning approach that learns policies for structured tree exploration, reducing reliance on heuristics and enhancing reasoning capabilities of LLMs.

Findings

01

PGTS outperforms existing methods on reasoning benchmarks.

02

It significantly reduces computational costs.

03

Demonstrates scalability across diverse reasoning tasks.

Abstract

Despite their remarkable capabilities, large language models often struggle with tasks requiring complex reasoning and planning. While existing approaches like Chain-of-Thought prompting and tree search techniques show promise, they are limited by their reliance on predefined heuristics and computationally expensive exploration strategies. We propose Policy-Guided Tree Search (PGTS), a framework that combines reinforcement learning with structured tree exploration to efficiently navigate reasoning paths. Our key innovation is a learned policy that dynamically decides between expanding, branching, backtracking, or terminating exploration, eliminating the need for manual heuristics or exhaustive search. Experiments across mathematical reasoning, logical deduction, and planning benchmarks demonstrate that PGTS achieves superior reasoning performance while significantly reducing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Rights Management and Security · Semantic Web and Ontologies