Learning Gender-Neutral Word Embeddings

Jieyu Zhao; Yichao Zhou; Zeyu Li; Wei Wang; Kai-Wei Chang

arXiv:1809.01496·cs.CL·September 6, 2018·33 cites

Learning Gender-Neutral Word Embeddings

Jieyu Zhao, Yichao Zhou, Zeyu Li, Wei Wang, Kai-Wei Chang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel training method to create gender-neutral word embeddings that retain essential information while removing gender bias, improving fairness in NLP applications.

Contribution

The paper proposes a new training procedure for learning gender-neutral embeddings, exemplified by the GN-GloVe model, which isolates gender information without losing embedding utility.

Findings

01

GN-GloVe effectively isolates gender information.

02

The method preserves embedding functionality.

03

Gender bias is reduced in the resulting embeddings.

Abstract

Word embedding models have become a fundamental component in a wide range of Natural Language Processing (NLP) applications. However, embeddings trained on human-generated corpora have been demonstrated to inherit strong gender stereotypes that reflect social constructs. To address this concern, in this paper, we propose a novel training procedure for learning gender-neutral word embeddings. Our approach aims to preserve gender information in certain dimensions of word vectors while compelling other dimensions to be free of gender influence. Based on the proposed method, we generate a Gender-Neutral variant of GloVe (GN-GloVe). Quantitative and qualitative experiments demonstrate that GN-GloVe successfully isolates gender information without sacrificing the functionality of the embedding model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uclanlp/gn_glove
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Hate Speech and Cyberbullying Detection

MethodsGloVe Embeddings