
TL;DR
This paper introduces a novel search paradigm for large language models inspired by AlphaGo, aiming to improve sequence generation by exploring diverse paths and evaluating them based on model confidence.
Contribution
It proposes a new search-based method for LLMs that enhances output quality, diversity, and problem-solving capabilities by adapting tree search techniques.
Findings
Potential to increase output quality and diversity
Reduces errors and compound error problems
Enables iterative problem-solving and self-training
Abstract
This project aims to investigate a novel sequence generation method inspired by the AlphaGo paradigm, adapting it for use with large language models (LLMs). The proposed approach involves creating search trees of different possible completions and evaluating these completions based on model confidence. By considering various paths in the search tree and scoring them according to the model's confidence in each completion, we can generate diverse and high-quality sequences. This research explores the implementation of this paradigm by using confidence as a proxy for response quality akin to beam search \citep{vijayakumar2016diverse}. The primary goal of this paper is to outline the paradigm and demonstrate its potential, rather than focusing on achieving perfect results. The paper will outline the reasons why we believe this paradigm has the potential to improve LLMs in the following…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing · Semantic Web and Ontologies
MethodsSparse Evolutionary Training
