Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased   Proximities in Word Embeddings

Vaibhav Kumar; Tenzin Singhay Bhotia; Vaibhav Kumar; Tanmoy; Chakraborty

arXiv:2006.01938·cs.CL·June 4, 2020

Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings

Vaibhav Kumar, Tenzin Singhay Bhotia, Vaibhav Kumar, Tanmoy, Chakraborty

PDF

1 Repo

TL;DR

This paper introduces RAN-Debias, a novel method for reducing gender bias in word embeddings by altering spatial distributions, and proposes GIPE as a new bias evaluation metric, showing significant bias reduction with minimal semantic change.

Contribution

The paper presents RAN-Debias, a new debiasing technique that effectively reduces gender bias in word embeddings while preserving semantic integrity, and introduces GIPE for bias measurement.

Findings

01

RAN-Debias reduces proximity bias by at least 42.02%.

02

It minimizes semantic disturbance compared to existing methods.

03

Achieves top performance in coreference resolution tasks.

Abstract

Word embeddings are the standard model for semantic and syntactic representations of words. Unfortunately, these models have been shown to exhibit undesirable word associations resulting from gender, racial, and religious biases. Existing post-processing methods for debiasing word embeddings are unable to mitigate gender bias hidden in the spatial arrangement of word vectors. In this paper, we propose RAN-Debias, a novel gender debiasing methodology which not only eliminates the bias present in a word vector but also alters the spatial distribution of its neighbouring vectors, achieving a bias-free setting while maintaining minimal semantic offset. We also propose a new bias evaluation metric - Gender-based Illicit Proximity Estimate (GIPE), which measures the extent of undue proximity in word vectors resulting from the presence of gender-based predilections. Experiments based on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TimeTraveller-San/RAN-Debias
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.