A Review on Language Models as Knowledge Bases

Badr AlKhamissi; Millicent Li; Asli Celikyilmaz; Mona Diab; Marjan; Ghazvininejad

arXiv:2204.06031·cs.CL·April 14, 2022·63 cites

A Review on Language Models as Knowledge Bases

Badr AlKhamissi, Millicent Li, Asli Celikyilmaz, Mona Diab, Marjan, Ghazvininejad

PDF

Open Access

TL;DR

This paper reviews how pretrained language models can serve as implicit knowledge bases, capturing information without human supervision, and discusses the necessary aspects for their effective use as KBs.

Contribution

It provides a comprehensive review of recent literature on using language models as knowledge bases and outlines key aspects for their effectiveness.

Findings

01

Language models encode significant knowledge in parameters

02

Probing LMs can reveal various types of knowledge

03

LM-based KBs require specific aspects for optimal performance

Abstract

Recently, there has been a surge of interest in the NLP community on the use of pretrained Language Models (LMs) as Knowledge Bases (KBs). Researchers have shown that LMs trained on a sufficiently large (web) corpus will encode a significant amount of knowledge implicitly in its parameters. The resulting LM can be probed for different kinds of knowledge and thus acting as a KB. This has a major advantage over traditional KBs in that this method requires no human supervision. In this paper, we present a set of aspects that we deem a LM should have to fully act as a KB, and review the recent literature with respect to those aspects.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management