Large Language Model Programs

Imanol Schlag; Sainbayar Sukhbaatar; Asli Celikyilmaz; Wen-tau Yih,; Jason Weston; J\"urgen Schmidhuber; Xian Li

arXiv:2305.05364·cs.LG·May 10, 2023·5 cites

Large Language Model Programs

Imanol Schlag, Sainbayar Sukhbaatar, Asli Celikyilmaz, Wen-tau Yih,, Jason Weston, J\"urgen Schmidhuber, Xian Li

PDF

Open Access

TL;DR

This paper explores embedding large language models within algorithms to enhance their capabilities, demonstrating a 6.4% improvement in evidence-supported question-answering without finetuning.

Contribution

It introduces a method to embed LLMs in algorithms, expanding their functionality beyond traditional in-context learning approaches.

Findings

01

6.4% improvement over chain of thought baseline

02

Enhanced question-answering performance without finetuning

03

Discussion of advantages and disadvantages of the approach

Abstract

In recent years, large pre-trained language models (LLMs) have demonstrated the ability to follow instructions and perform novel tasks from a few examples. The possibility to parameterise an LLM through such in-context examples widens their capability at a much lower cost than finetuning. We extend this line of reasoning and present a method which further expands the capabilities of an LLM by embedding it within an algorithm or program. To demonstrate the benefits of this approach, we present an illustrative example of evidence-supported question-answering. We obtain a 6.4\% improvement over the chain of thought baseline through a more algorithmic approach without any finetuning. Furthermore, we highlight recent work from this perspective and discuss the advantages and disadvantages in comparison to the standard approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems