Biased Embeddings from Wild Data: Measuring, Understanding and Removing

Adam Sutton; Thomas Lansdall-Welfare; Nello Cristianini

arXiv:1806.06301·cs.CL·June 19, 2018·1 cites

Biased Embeddings from Wild Data: Measuring, Understanding and Removing

Adam Sutton, Thomas Lansdall-Welfare, Nello Cristianini

PDF

Open Access

TL;DR

This paper introduces methods to measure, understand, and mitigate biases in NLP embeddings derived from real-world data, highlighting the connection between embedding bias and societal biases, and proposing a simple bias reduction technique.

Contribution

It provides a rigorous bias measurement approach, analyzes the reflection of societal biases in embeddings, and demonstrates an effective bias removal method.

Findings

01

Bias measurement correlates with social psychology word lists.

02

Gender bias in embeddings mirrors real-world occupational gender bias.

03

Simple projection reduces embedding bias significantly.

Abstract

Many modern Artificial Intelligence (AI) systems make use of data embeddings, particularly in the domain of Natural Language Processing (NLP). These embeddings are learnt from data that has been gathered "from the wild" and have been found to contain unwanted biases. In this paper we make three contributions towards measuring, understanding and removing this problem. We present a rigorous way to measure some of these biases, based on the use of word lists created for social psychology applications; we observe how gender bias in occupations reflects actual gender bias in the same occupations in the real world; and finally we demonstrate how a simple projection can significantly reduce the effects of embedding bias. All this is part of an ongoing effort to understand how trust can be built into AI systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning