Undesirable Biases in NLP: Addressing Challenges of Measurement
Oskar van der Wal, Dominik Bachmann, Alina Leidinger, Leendert van, Maanen, Willem Zuidema, Katrin Schulz

TL;DR
This paper highlights the challenges in measuring biases in NLP models and proposes using psychometric principles like construct validity and reliability to improve bias assessment methods.
Contribution
It introduces an interdisciplinary approach applying psychometric concepts to enhance the measurement of biases in NLP models.
Findings
Psychometric concepts can improve bias measurement in NLP.
Current bias measures often lack validity and reliability.
Interdisciplinary tools can lead to better bias mitigation strategies.
Abstract
As Large Language Models and Natural Language Processing (NLP) technology rapidly develop and spread into daily life, it becomes crucial to anticipate how their use could harm people. One problem that has received a lot of attention in recent years is that this technology has displayed harmful biases, from generating derogatory stereotypes to producing disparate outcomes for different social groups. Although a lot of effort has been invested in assessing and mitigating these biases, our methods of measuring the biases of NLP models have serious problems and it is often unclear what they actually measure. In this paper, we provide an interdisciplinary approach to discussing the issue of NLP model bias by adopting the lens of psychometrics -- a field specialized in the measurement of concepts like bias that are not directly observable. In particular, we will explore two central notions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
