Generics are puzzling. Can language models find the missing piece?

Gustavo Cilleruelo Calder\'on; Emily Allaway; Barry Haddow; Alexandra; Birch

arXiv:2412.11318·cs.CL·December 17, 2024

Generics are puzzling. Can language models find the missing piece?

Gustavo Cilleruelo Calder\'on, Emily Allaway, Barry Haddow, Alexandra, Birch

PDF

Open Access 1 Repo

TL;DR

This paper investigates how language models understand generics, revealing their context sensitivity, the expression of weak generalizations, and the reflection of stereotypes, through a new dataset and surprisal-based metric.

Contribution

The study introduces ConGen, a dataset of generic sentences, and a surprisal-based metric to analyze generics' context sensitivity and implicit quantification in language models.

Findings

01

Generics are more context-sensitive than determiner quantifiers.

02

Approximately 20% of generics express weak generalizations.

03

Language models reflect human stereotypes in their responses.

Abstract

Generic sentences express generalisations about the world without explicit quantification. Although generics are central to everyday communication, building a precise semantic framework has proven difficult, in part because speakers use generics to generalise properties with widely different statistical prevalence. In this work, we study the implicit quantification and context-sensitivity of generics by leveraging language models as models of language. We create ConGen, a dataset of 2873 naturally occurring generic and quantified sentences in context, and define p-acceptability, a metric based on surprisal that is sensitive to quantification. Our experiments show generics are more context-sensitive than determiner quantifiers and about 20% of naturally occurring generics we analyze express weak generalisations. We also explore how human biases in stereotypes can be observed in language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ilyocoris/generics_are_puzzling
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques