Analyzing Correlations Between Intrinsic and Extrinsic Bias Metrics of   Static Word Embeddings With Their Measuring Biases Aligned

Taisei Kat\^o; Yusuke Miyao

arXiv:2409.09260·cs.CL·September 17, 2024

Analyzing Correlations Between Intrinsic and Extrinsic Bias Metrics of Static Word Embeddings With Their Measuring Biases Aligned

Taisei Kat\^o, Yusuke Miyao

PDF

Open Access

TL;DR

This paper investigates how well intrinsic bias metrics of static word embeddings predict biased behaviors in NLP systems by analyzing correlations with extrinsic bias metrics, revealing variable predictive power across different settings.

Contribution

It introduces a method to align intrinsic and extrinsic bias metrics by extracting characteristic words, clarifying when intrinsic metrics effectively predict bias.

Findings

01

Moderate to high correlation with some extrinsic bias metrics.

02

Little to no correlation with other extrinsic bias metrics.

03

Intrinsic bias metrics can predict bias in specific contexts.

Abstract

We examine the abilities of intrinsic bias metrics of static word embeddings to predict whether Natural Language Processing (NLP) systems exhibit biased behavior. A word embedding is one of the fundamental NLP technologies that represents the meanings of words through real vectors, and problematically, it also learns social biases such as stereotypes. An intrinsic bias metric measures bias by examining a characteristic of vectors, while an extrinsic bias metric checks whether an NLP system trained with a word embedding is biased. A previous study found that a common intrinsic bias metric usually does not correlate with extrinsic bias metrics. However, the intrinsic and extrinsic bias metrics did not measure the same bias in most cases, which makes us question whether the lack of correlation is genuine. In this paper, we extract characteristic words from datasets of extrinsic bias…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques