CorpusBrain: Pre-train a Generative Retrieval Model for   Knowledge-Intensive Language Tasks

Jiangui Chen; Ruqing Zhang; Jiafeng Guo; Yiqun Liu; Yixing Fan; Xueqi; Cheng

arXiv:2208.07652·cs.CL·August 17, 2022

CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks

Jiangui Chen, Ruqing Zhang, Jiafeng Guo, Yiqun Liu, Yixing Fan, Xueqi, Cheng

PDF

1 Repo

TL;DR

CorpusBrain introduces a pre-trained generative retrieval model that simplifies knowledge-intensive task retrieval, replacing traditional pipelines with end-to-end training, leading to state-of-the-art results on KILT benchmarks.

Contribution

The paper presents a novel single-step generative retrieval model, CorpusBrain, pre-trained with specialized tasks, enabling end-to-end optimization and improved performance on knowledge-intensive tasks.

Findings

01

Outperforms strong baselines on KILT benchmark

02

Effective in zero- and low-resource settings

03

Encodes entire corpus information in model parameters

Abstract

Knowledge-intensive language tasks (KILT) usually require a large body of information to provide correct answers. A popular paradigm to solve this problem is to combine a search system with a machine reader, where the former retrieves supporting evidences and the latter examines them to produce answers. Recently, the reader component has witnessed significant advances with the help of large-scale pre-trained generative models. Meanwhile most existing solutions in the search component rely on the traditional ``index-retrieve-then-rank'' pipeline, which suffers from large memory footprint and difficulty in end-to-end optimization. Inspired by recent efforts in constructing model-based IR models, we propose to replace the traditional multi-step search pipeline with a novel single-step generative model, which can dramatically simplify the search process and be optimized in an end-to-end…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ict-bigdatalab/corpusbrain
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.