Generate rather than Retrieve: Large Language Models are Strong Context   Generators

Wenhao Yu; Dan Iter; Shuohang Wang; Yichong Xu; Mingxuan Ju; Soumya; Sanyal; Chenguang Zhu; Michael Zeng; Meng Jiang

arXiv:2209.10063·cs.CL·January 26, 2023·86 cites

Generate rather than Retrieve: Large Language Models are Strong Context Generators

Wenhao Yu, Dan Iter, Shuohang Wang, Yichong Xu, Mingxuan Ju, Soumya, Sanyal, Chenguang Zhu, Michael Zeng, Meng Jiang

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces GenRead, a novel approach that replaces document retrieval with large language model generation for knowledge-intensive tasks, achieving superior results without external document retrieval.

Contribution

The paper proposes a generate-then-read framework and a clustering-based prompting method, significantly improving performance on knowledge tasks over traditional retrieve-then-read pipelines.

Findings

01

GenRead achieves 71.6 and 54.4 exact match scores on TriviaQA and WebQ.

02

GenRead outperforms the state-of-the-art retrieve-then-read pipeline DPR-FiD.

03

Combining retrieval and generation further enhances model performance.

Abstract

Knowledge-intensive tasks, such as open-domain question answering (QA), require access to a large amount of world or domain knowledge. A common approach for knowledge-intensive tasks is to employ a retrieve-then-read pipeline that first retrieves a handful of relevant contextual documents from an external corpus such as Wikipedia and then predicts an answer conditioned on the retrieved documents. In this paper, we present a novel perspective for solving knowledge-intensive tasks by replacing document retrievers with large language model generators. We call our method generate-then-read (GenRead), which first prompts a large language model to generate contextutal documents based on a given question, and then reads the generated documents to produce the final answer. Furthermore, we propose a novel clustering-based prompting method that selects distinct prompts, resulting in the generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Generate rather than Retrieve: Large Language Models are Strong Context Generators· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications