Measuring and Improving Consistency in Pretrained Language Models

Yanai Elazar; Nora Kassner; Shauli Ravfogel; Abhilasha Ravichander,; Eduard Hovy; Hinrich Sch\"utze; Yoav Goldberg

arXiv:2102.01017·cs.CL·June 1, 2021·32 cites

Measuring and Improving Consistency in Pretrained Language Models

Yanai Elazar, Nora Kassner, Shauli Ravfogel, Abhilasha Ravichander,, Eduard Hovy, Hinrich Sch\"utze, Yoav Goldberg

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the consistency of pretrained language models with respect to factual knowledge, introduces a new paraphrase dataset, and proposes a method to enhance model consistency.

Contribution

It introduces ParaRel, a high-quality paraphrase dataset for evaluating consistency, and presents a novel method to improve the consistency of pretrained language models.

Findings

01

PLMs show poor consistency with high variance across relations.

02

Representational spaces of PLMs are poorly structured for robust knowledge representation.

03

The proposed method effectively improves model consistency.

Abstract

Consistency of a model -- that is, the invariance of its behavior under meaning-preserving alternations in its input -- is a highly desirable property in natural language processing. In this paper we study the question: Are Pretrained Language Models (PLMs) consistent with respect to factual knowledge? To this end, we create ParaRel, a high-quality resource of cloze-style query English paraphrases. It contains a total of 328 paraphrases for 38 relations. Using ParaRel, we show that the consistency of all PLMs we experiment with is poor -- though with high variance between relations. Our analysis of the representational spaces of PLMs suggests that they have a poor structure and are currently not suitable for representing knowledge robustly. Finally, we propose a method for improving model consistency and experimentally demonstrate its effectiveness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yanaiela/pararel
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications