Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark   Datasets

Tosin Adewumi; Isabella S\"odergren; Lama Alkhaled; Sana Sabah Sabry,; Foteini Liwicki; Marcus Liwicki

arXiv:2301.12139·cs.CL·September 19, 2023

Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets

Tosin Adewumi, Isabella S\"odergren, Lama Alkhaled, Sana Sabah Sabry,, Foteini Liwicki, Marcus Liwicki

PDF

Open Access 2 Repos

TL;DR

This paper introduces bipol, a novel multi-axes bias metric with explainability, applied to English and Swedish NLP datasets, along with a large Swedish bias-labelled dataset and new lexica, to objectively evaluate and explain bias in benchmark datasets.

Contribution

The paper presents bipol, a new multi-axes bias evaluation method with explainability, and provides a large Swedish bias dataset and lexica, advancing multilingual bias analysis in NLP.

Findings

01

Bipol effectively estimates bias across multiple axes.

02

The Swedish dataset contains 2 million bias-labelled samples.

03

Multilingual bias evaluation is feasible with the new resources.

Abstract

We investigate five English NLP benchmark datasets (on the superGLUE leaderboard) and two Swedish datasets for bias, along multiple axes. The datasets are the following: Boolean Question (Boolq), CommitmentBank (CB), Winograd Schema Challenge (WSC), Wino-gender diagnostic (AXg), Recognising Textual Entailment (RTE), Swedish CB, and SWEDN. Bias can be harmful and it is known to be common in data, which ML models learn from. In order to mitigate bias in data, it is crucial to be able to estimate it objectively. We use bipol, a novel multi-axes bias metric with explainability, to estimate and explain how much bias exists in these datasets. Multilingual, multi-axes bias evaluation is not very common. Hence, we also contribute a new, large Swedish bias-labelled dataset (of 2 million samples), translated from the English version and train the SotA mT5 model on it. In addition, we contribute…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods

MethodsAttention Is All You Need · Byte Pair Encoding · Linear Layer · SentencePiece · Multi-Head Attention · Residual Connection · Softmax · Dense Connections · Dropout · Layer Normalization