Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets
Tosin Adewumi, Isabella S\"odergren, Lama Alkhaled, Sana Sabah Sabry,, Foteini Liwicki, Marcus Liwicki

TL;DR
This paper introduces bipol, a novel multi-axes bias metric with explainability, applied to English and Swedish NLP datasets, along with a large Swedish bias-labelled dataset and new lexica, to objectively evaluate and explain bias in benchmark datasets.
Contribution
The paper presents bipol, a new multi-axes bias evaluation method with explainability, and provides a large Swedish bias dataset and lexica, advancing multilingual bias analysis in NLP.
Findings
Bipol effectively estimates bias across multiple axes.
The Swedish dataset contains 2 million bias-labelled samples.
Multilingual bias evaluation is feasible with the new resources.
Abstract
We investigate five English NLP benchmark datasets (on the superGLUE leaderboard) and two Swedish datasets for bias, along multiple axes. The datasets are the following: Boolean Question (Boolq), CommitmentBank (CB), Winograd Schema Challenge (WSC), Wino-gender diagnostic (AXg), Recognising Textual Entailment (RTE), Swedish CB, and SWEDN. Bias can be harmful and it is known to be common in data, which ML models learn from. In order to mitigate bias in data, it is crucial to be able to estimate it objectively. We use bipol, a novel multi-axes bias metric with explainability, to estimate and explain how much bias exists in these datasets. Multilingual, multi-axes bias evaluation is not very common. Hence, we also contribute a new, large Swedish bias-labelled dataset (of 2 million samples), translated from the English version and train the SotA mT5 model on it. In addition, we contribute…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods
MethodsAttention Is All You Need · Byte Pair Encoding · Linear Layer · SentencePiece · Multi-Head Attention · Residual Connection · Softmax · Dense Connections · Dropout · Layer Normalization
