Language-Agnostic Bias Detection in Language Models with Bias Probing

Abdullatif K\"oksal; Omer Faruk Yalcin; Ahmet Akbiyik; M. Tahir; Kilavuz; Anna Korhonen; Hinrich Sch\"utze

arXiv:2305.13302·cs.CL·November 21, 2023·1 cites

Language-Agnostic Bias Detection in Language Models with Bias Probing

Abdullatif K\"oksal, Omer Faruk Yalcin, Ahmet Akbiyik, M. Tahir, Kilavuz, Anna Korhonen, Hinrich Sch\"utze

PDF

Open Access 1 Repo

TL;DR

This paper introduces LABDet, a robust, language-agnostic bias probing method for pretrained language models, revealing consistent nationality biases across languages and linking these biases to pretraining data.

Contribution

The paper presents LABDet, a novel bias detection technique that is robust, language-agnostic, and capable of directly linking biases to pretraining data in language models.

Findings

01

Nationality bias patterns align with historical context across six languages.

02

Bias in English BERT correlates with pretraining data biases.

03

LABDet is reliable across different templates and languages.

Abstract

Pretrained language models (PLMs) are key components in NLP, but they contain strong social biases. Quantifying these biases is challenging because current methods focusing on fill-the-mask objectives are sensitive to slight changes in input. To address this, we propose a bias probing technique called LABDet, for evaluating social bias in PLMs with a robust and language-agnostic method. For nationality as a case study, we show that LABDet `surfaces' nationality bias by training a classifier on top of a frozen PLM on non-nationality sentiment detection. We find consistent patterns of nationality bias across monolingual PLMs in six languages that align with historical and political context. We also show for English BERT that bias surfaced by LABDet correlates well with bias in the pretraining data; thus, our work is one of the few studies that directly links pretraining data to PLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

akoksal/labdet
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Dropout · Linear Layer · Attention Dropout · Adam · Dense Connections