REALM: Retrieval-Augmented Language Model Pre-Training
Kelvin Guu, Kenton Lee, Zora Tung, Panupong Pasupat, Ming-Wei Chang

TL;DR
REALM introduces a retrieval-augmented pre-training method for language models that enhances knowledge access, interpretability, and modularity, significantly improving open-domain question answering performance.
Contribution
It presents a novel unsupervised pre-training approach for a knowledge retriever integrated with language models, enabling effective retrieval of documents during training and inference.
Findings
Outperforms previous models on open-domain QA benchmarks by 4-16% accuracy
First to pre-train a knowledge retriever in an unsupervised manner using masked language modeling
Provides benefits in interpretability and modularity of language models
Abstract
Language model pre-training has been shown to capture a surprising amount of world knowledge, crucial for NLP tasks such as question answering. However, this knowledge is stored implicitly in the parameters of a neural network, requiring ever-larger networks to cover more facts. To capture knowledge in a more modular and interpretable way, we augment language model pre-training with a latent knowledge retriever, which allows the model to retrieve and attend over documents from a large corpus such as Wikipedia, used during pre-training, fine-tuning and inference. For the first time, we show how to pre-train such a knowledge retriever in an unsupervised manner, using masked language modeling as the learning signal and backpropagating through a retrieval step that considers millions of documents. We demonstrate the effectiveness of Retrieval-Augmented Language Model pre-training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google/t5-11b-ssm-nqmodel· 34 dl34 dl
- 🤗google/t5-11b-ssm-nqomodel· 10 dl10 dl
- 🤗google/t5-11b-ssm-tqamodel· 16 dl· ♡ 1216 dl♡ 12
- 🤗google/t5-11b-ssm-tqaomodel· 19 dl19 dl
- 🤗google/t5-11b-ssm-wqmodel· 10 dl· ♡ 110 dl♡ 1
- 🤗google/t5-11b-ssm-wqomodel· ♡ 1♡ 1
- 🤗google/t5-11b-ssmmodel· 13 dl13 dl
- 🤗google/t5-3b-ssm-nqmodel· 31 dl31 dl
- 🤗google/t5-3b-ssm-nqomodel· 16 dl16 dl
- 🤗google/t5-3b-ssmmodel· 16 dl· ♡ 116 dl♡ 1
Videos
REALM: Retrieval-Augmented Language Model Pre-Training (Paper Explained)· youtube
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
MethodsInterpretability
