Low Frequency Names Exhibit Bias and Overfitting in Contextualizing   Language Models

Robert Wolfe; Aylin Caliskan

arXiv:2110.00672·cs.CY·October 5, 2021

Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models

Robert Wolfe, Aylin Caliskan

PDF

Open Access

TL;DR

This paper investigates how low-frequency names in training data cause bias and overfitting in language models like BERT and GPT-2, affecting their representation and bias towards minority groups.

Contribution

It reveals the relationship between name frequency, bias, and overfitting in multiple language models, highlighting the impact on minority and female names.

Findings

01

Infrequent names are less similar to initial representations.

02

Lower-frequency minority names are more associated with unpleasantness.

03

Models rely on less context-informed representations for rare names.

Abstract

We use a dataset of U.S. first names with labels based on predominant gender and racial group to examine the effect of training corpus frequency on tokenization, contextualization, similarity to initial representation, and bias in BERT, GPT-2, T5, and XLNet. We show that predominantly female and non-white names are less frequent in the training corpora of these four language models. We find that infrequent names are more self-similar across contexts, with Spearman's r between frequency and self-similarity as low as -.763. Infrequent names are also less similar to initial representation, with Spearman's r between frequency and linear centered kernel alignment (CKA) similarity to initial representation as high as .702. Moreover, we find Spearman's r between racial bias and name frequency in BERT of .492, indicating that lower-frequency minority group names are more associated with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Authorship Attribution and Profiling

MethodsGated Linear Unit · Attention Is All You Need · Linear Layer · Cosine Annealing · WordPiece · Linear Warmup With Cosine Annealing · Adam · Attention Dropout · Residual Connection · Byte Pair Encoding