COMPS: Conceptual Minimal Pair Sentences for testing Robust Property   Knowledge and its Inheritance in Pre-trained Language Models

Kanishka Misra; Julia Taylor Rayz; Allyson Ettinger

arXiv:2210.01963·cs.CL·February 10, 2023

COMPS: Conceptual Minimal Pair Sentences for testing Robust Property Knowledge and its Inheritance in Pre-trained Language Models

Kanishka Misra, Julia Taylor Rayz, Allyson Ettinger

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces COMPS, a dataset of minimal pair sentences designed to evaluate pre-trained language models' abilities to attribute properties to concepts and demonstrate property inheritance, revealing strengths and limitations in their reasoning capabilities.

Contribution

The paper presents COMPS, a novel benchmark for testing property attribution and inheritance in PLMs, and analyzes 22 models to assess their reasoning robustness.

Findings

01

PLMs easily distinguish trivial property differences.

02

Models struggle with nuanced concept-property relations.

03

Performance drops significantly with distracting information.

Abstract

A characteristic feature of human semantic cognition is its ability to not only store and retrieve the properties of concepts observed through experience, but to also facilitate the inheritance of properties (can breathe) from superordinate concepts (animal) to their subordinates (dog) -- i.e. demonstrate property inheritance. In this paper, we present COMPS, a collection of minimal pair sentences that jointly tests pre-trained language models (PLMs) on their ability to attribute properties to concepts and their ability to demonstrate property inheritance behavior. Analyses of 22 different PLMs on COMPS reveal that they can easily distinguish between concepts on the basis of a property when they are trivially different, but find it relatively difficult when concepts are related on the basis of nuanced knowledge representations. Furthermore, we find that PLMs can demonstrate behavior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kanishkamisra/comps
pytorchOfficial

Datasets

kanishka/comps
dataset· 13 dl
13 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques