CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked   Language Models

Nikita Nangia; Clara Vania; Rasika Bhalerao; Samuel R. Bowman

arXiv:2010.00133·cs.CL·October 2, 2020

CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models

Nikita Nangia, Clara Vania, Rasika Bhalerao, Samuel R. Bowman

PDF

2 Repos 7 Models 5 Datasets

TL;DR

CrowS-Pairs is a benchmark dataset designed to measure social biases in masked language models by comparing stereotypical and less stereotypical sentences across multiple bias categories, revealing prevalent biases in current models.

Contribution

Introduces CrowS-Pairs, a new dataset with 1508 examples to evaluate social biases in masked language models across nine bias types, focusing on disadvantaged groups.

Findings

01

All evaluated MLMs favor stereotypical sentences across categories.

02

CrowS-Pairs can serve as a benchmark for bias evaluation.

03

The dataset highlights the pervasiveness of social biases in language models.

Abstract

Pretrained language models, especially masked language models (MLMs) have seen success across many NLP tasks. However, there is ample evidence that they use the cultural biases that are undoubtedly present in the corpora they are trained on, implicitly creating harm with biased representations. To measure some forms of social bias in language models against protected demographic groups in the US, we introduce the Crowdsourced Stereotype Pairs benchmark (CrowS-Pairs). CrowS-Pairs has 1508 examples that cover stereotypes dealing with nine types of bias, like race, religion, and age. In CrowS-Pairs a model is presented with two sentences: one that is more stereotyping and another that is less stereotyping. The data focuses on stereotypes about historically disadvantaged groups and contrasts them with advantaged groups. We find that all three of the widely-used MLMs we evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.