LLM Tree Search

Dylan Wilson

arXiv:2410.19117·cs.CL·October 28, 2024

LLM Tree Search

Dylan Wilson

PDF

Open Access

TL;DR

This paper introduces a novel search paradigm for large language models inspired by AlphaGo, aiming to improve sequence generation by exploring diverse paths and evaluating them based on model confidence.

Contribution

It proposes a new search-based method for LLMs that enhances output quality, diversity, and problem-solving capabilities by adapting tree search techniques.

Findings

01

Potential to increase output quality and diversity

02

Reduces errors and compound error problems

03

Enables iterative problem-solving and self-training

Abstract

This project aims to investigate a novel sequence generation method inspired by the AlphaGo paradigm, adapting it for use with large language models (LLMs). The proposed approach involves creating search trees of different possible completions and evaluating these completions based on model confidence. By considering various paths in the search tree and scoring them according to the model's confidence in each completion, we can generate diverse and high-quality sequences. This research explores the implementation of this paradigm by using confidence as a proxy for response quality akin to beam search \citep{vijayakumar2016diverse}. The primary goal of this paper is to outline the paradigm and demonstrate its potential, rather than focusing on achieving perfect results. The paper will outline the reasons why we believe this paradigm has the potential to improve LLMs in the following…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing · Semantic Web and Ontologies

MethodsSparse Evolutionary Training