Attenuating Bias in Word Vectors

Sunipa Dev; Jeff Phillips

arXiv:1901.07656·cs.CL·January 24, 2019·96 cites

Attenuating Bias in Word Vectors

Sunipa Dev, Jeff Phillips

PDF

Open Access 1 Repo

TL;DR

This paper presents simple methods to detect and reduce gender, race, ethnicity, and age biases in word embeddings, improving fairness in NLP applications.

Contribution

It introduces new techniques using names to identify and attenuate various biases in word vectors, extending beyond gender to other social biases.

Findings

01

Names effectively reveal gender bias in embeddings.

02

Bias attenuation methods reduce stereotypical associations.

03

Names can detect multiple social biases in word vectors.

Abstract

Word vector representations are well developed tools for various NLP and Machine Learning tasks and are known to retain significant semantic and syntactic structure of languages. But they are prone to carrying and amplifying bias which can perpetrate discrimination in various applications. In this work, we explore new simple ways to detect the most stereotypically gendered words in an embedding and remove the bias from them. We verify how names are masked carriers of gender bias and then use that as a tool to attenuate bias in embeddings. Further, we extend this property of names to show how names can be used to detect other types of bias in the embeddings such as bias based on race, ethnicity, and age.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sunipa/Attenuating-Bias-in-Word-Vec
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Interpreting and Communication in Healthcare