Language-Agnostic Bias Detection in Language Models with Bias Probing
Abdullatif K\"oksal, Omer Faruk Yalcin, Ahmet Akbiyik, M. Tahir, Kilavuz, Anna Korhonen, Hinrich Sch\"utze

TL;DR
This paper introduces LABDet, a robust, language-agnostic bias probing method for pretrained language models, revealing consistent nationality biases across languages and linking these biases to pretraining data.
Contribution
The paper presents LABDet, a novel bias detection technique that is robust, language-agnostic, and capable of directly linking biases to pretraining data in language models.
Findings
Nationality bias patterns align with historical context across six languages.
Bias in English BERT correlates with pretraining data biases.
LABDet is reliable across different templates and languages.
Abstract
Pretrained language models (PLMs) are key components in NLP, but they contain strong social biases. Quantifying these biases is challenging because current methods focusing on fill-the-mask objectives are sensitive to slight changes in input. To address this, we propose a bias probing technique called LABDet, for evaluating social bias in PLMs with a robust and language-agnostic method. For nationality as a case study, we show that LABDet `surfaces' nationality bias by training a classifier on top of a frozen PLM on non-nationality sentiment detection. We find consistent patterns of nationality bias across monolingual PLMs in six languages that align with historical and political context. We also show for English BERT that bias surfaced by LABDet correlates well with bias in the pretraining data; thus, our work is one of the few studies that directly links pretraining data to PLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Dropout · Linear Layer · Attention Dropout · Adam · Dense Connections
