Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

Francisco Vargas; Ryan Cotterell

arXiv:2009.09435·cs.LG·May 24, 2024

Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

Francisco Vargas, Ryan Cotterell

PDF

1 Repo

TL;DR

This paper investigates whether gender bias in word representations can be effectively isolated using linear methods, and introduces a nonlinear approach to test the linearity assumption, ultimately confirming the linear subspace hypothesis.

Contribution

It generalizes existing linear bias mitigation techniques to a nonlinear kernelized version and empirically verifies the linearity of gender bias in word embeddings.

Findings

01

Gender bias is well captured by a linear subspace.

02

The nonlinear method confirms the linearity assumption.

03

The approach improves bias mitigation techniques.

Abstract

Bolukbasi et al. (2016) presents one of the first gender bias mitigation techniques for word representations. Their method takes pre-trained word representations as input and attempts to isolate a linear subspace that captures most of the gender bias in the representations. As judged by an analogical evaluation task, their method virtually eliminates gender bias in the representations. However, an implicit and untested assumption of their method is that the bias subspace is actually linear. In this work, we generalize their method to a kernelized, nonlinear version. We take inspiration from kernel principal component analysis and derive a nonlinear bias isolation technique. We discuss and overcome some of the practical drawbacks of our method for non-linear gender bias mitigation in word representations and analyze empirically whether the bias subspace is actually linear. Our analysis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

franciscovargas/Bias_space_study
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.