How Can We Know What Language Models Know?

Zhengbao Jiang; Frank F. Xu; Jun Araki; Graham Neubig

arXiv:1911.12543·cs.CL·May 5, 2020

How Can We Know What Language Models Know?

Zhengbao Jiang, Frank F. Xu, Jun Araki, Graham Neubig

PDF

1 Repo

TL;DR

This paper introduces methods to automatically generate and combine better prompts for querying language models, significantly improving the accuracy of knowledge extraction from LMs.

Contribution

It proposes mining and paraphrasing techniques to discover high-quality prompts, enhancing the estimation of what language models truly know.

Findings

01

Improved accuracy from 31.1% to 39.6% on the LAMA benchmark.

02

Demonstrated that prompt quality greatly affects knowledge retrieval.

03

Provided open-source tools for better prompt generation and querying.

Abstract

Recent work has presented intriguing results examining the knowledge contained in language models (LM) by having the LM fill in the blanks of prompts such as "Obama is a _ by profession". These prompts are usually manually created, and quite possibly sub-optimal; another prompt such as "Obama worked as a _" may result in more accurately predicting the correct profession. Because of this, given an inappropriate prompt, we might fail to retrieve facts that the LM does know, and thus any given prompt only provides a lower bound estimate of the knowledge contained in an LM. In this paper, we attempt to more accurately estimate the knowledge contained in LMs by automatically discovering better prompts to use in this querying process. Specifically, we propose mining-based and paraphrasing-based methods to automatically generate high-quality and diverse prompts, as well as ensemble methods to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jzbjyb/LPAQA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.