Evaluating Gender Bias of Pre-trained Language Models in Natural   Language Inference by Considering All Labels

Panatchakorn Anantaprayoon; Masahiro Kaneko; Naoaki Okazaki

arXiv:2309.09697·cs.CL·May 21, 2024·1 cites

Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels

Panatchakorn Anantaprayoon, Masahiro Kaneko, Naoaki Okazaki

PDF

Open Access 1 Repo

TL;DR

This paper introduces NLI-CoAL, a novel bias evaluation method for pre-trained language models in natural language inference that considers all label types, improving bias detection accuracy across multiple languages.

Contribution

It proposes a comprehensive bias measure considering all NLI labels, creates multilingual evaluation datasets, and validates the measure's effectiveness and cross-lingual applicability.

Findings

01

NLI-CoAL outperforms baseline bias measures in distinguishing biased inferences.

02

The method is effective across English, Japanese, and Chinese datasets.

03

First to evaluate PLM bias in Japanese and Chinese NLI tasks.

Abstract

Discriminatory gender biases have been found in Pre-trained Language Models (PLMs) for multiple languages. In Natural Language Inference (NLI), existing bias evaluation methods have focused on the prediction results of one specific label out of three labels, such as neutral. However, such evaluation methods can be inaccurate since unique biased inferences are associated with unique prediction labels. Addressing this limitation, we propose a bias evaluation method for PLMs, called NLI-CoAL, which considers all the three labels of NLI task. First, we create three evaluation data groups that represent different types of biases. Then, we define a bias measure based on the corresponding label output of each data group. In the experiments, we introduce a meta-evaluation technique for NLI bias measures and use it to confirm that our bias measure can distinguish biased, incorrect inferences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

panatchakorn-a/bias-eval-nli-considering-all-labels
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling