Language Models as Knowledge Bases?
Fabio Petroni, Tim Rockt\"aschel, Patrick Lewis, Anton Bakhtin,, Yuxiang Wu, Alexander H. Miller, Sebastian Riedel

TL;DR
Pretrained language models inherently store relational and factual knowledge, enabling them to perform well on knowledge retrieval and open-domain question answering without additional fine-tuning.
Contribution
This paper provides an in-depth analysis showing that state-of-the-art language models contain substantial relational knowledge comparable to traditional methods, highlighting their potential as unsupervised QA systems.
Findings
BERT contains relational knowledge competitive with traditional NLP methods.
BERT performs well on open-domain question answering without fine-tuning.
Certain types of factual knowledge are learned more readily than others.
Abstract
Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the training data, and may be able to answer queries structured as "fill-in-the-blank" cloze statements. Language models have many advantages over structured knowledge bases: they require no schema engineering, allow practitioners to query about an open class of relations, are easy to extend to more data, and require no human supervision to train. We present an in-depth analysis of the relational knowledge already present (without fine-tuning) in a wide range of state-of-the-art pretrained language models. We find that (i) without fine-tuning, BERT contains relational knowledge competitive with traditional NLP methods that have some access to oracle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
