Gender-preserving Debiasing for Pre-trained Word Embeddings

Masahiro Kaneko; Danushka Bollegala

arXiv:1906.00742·cs.CL·June 4, 2019·21 cites

Gender-preserving Debiasing for Pre-trained Word Embeddings

Masahiro Kaneko, Danushka Bollegala

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel debiasing method for pre-trained word embeddings that effectively removes gender stereotypes while preserving genuine gender-related information, improving fairness in NLP applications.

Contribution

The proposed method uniquely preserves non-discriminative gender information and neutral words while removing stereotypes, outperforming existing debiasing techniques.

Findings

01

Outperforms state-of-the-art debiasing methods

02

Preserves essential gender-related information

03

Reduces stereotypical biases in word embeddings

Abstract

Word embeddings learnt from massive text collections have demonstrated significant levels of discriminative biases such as gender, racial or ethnic biases, which in turn bias the down-stream NLP applications that use those word embeddings. Taking gender-bias as a working example, we propose a debiasing method that preserves non-discriminative gender-related information, while removing stereotypical discriminative gender biases from pre-trained word embeddings. Specifically, we consider four types of information: \emph{feminine}, \emph{masculine}, \emph{gender-neutral} and \emph{stereotypical}, which represent the relationship between gender vs. bias, and propose a debiasing method that (a) preserves the gender-related information in feminine and masculine words, (b) preserves the neutrality in gender-neutral words, and (c) removes the biases from stereotypical words. Experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kanekomasahiro/gp_debias
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification