Evaluating Gender Bias in Natural Language Inference

Shanya Sharma; Manan Dey; Koustuv Sinha

arXiv:2105.05541·cs.CL·May 13, 2021·6 cites

Evaluating Gender Bias in Natural Language Inference

Shanya Sharma, Manan Dey, Koustuv Sinha

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new evaluation method to detect gender bias in natural language inference models, revealing that popular models exhibit significant gender stereotypes, which can be mitigated with dataset balancing techniques.

Contribution

It proposes a challenge task for measuring gender bias in NLI models and evaluates state-of-the-art models, highlighting the effectiveness of debiasing strategies.

Findings

01

BERT, RoBERTa, BART models show gender bias in predictions.

02

Debiasing by gender-balanced datasets reduces bias.

03

Models trained on MNLI and SNLI are prone to gender stereotypes.

Abstract

Gender-bias stereotypes have recently raised significant ethical concerns in natural language processing. However, progress in detection and evaluation of gender bias in natural language understanding through inference is limited and requires further investigation. In this work, we propose an evaluation methodology to measure these biases by constructing a challenge task that involves pairing gender-neutral premises against a gender-specific hypothesis. We use our challenge task to investigate state-of-the-art NLI models on the presence of gender stereotypes using occupations. Our findings suggest that three models (BERT, RoBERTa, BART) trained on MNLI and SNLI datasets are significantly prone to gender-induced prediction errors. We also find that debiasing techniques such as augmenting the training dataset to ensure a gender-balanced dataset can help reduce such bias in certain cases.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shanyas10/Evaluating-gender-bias
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Layer Normalization · Linear Warmup With Linear Decay · Softmax · Multi-Head Attention · Residual Connection · WordPiece · Weight Decay