A Generative Approach for Mitigating Structural Biases in Natural   Language Inference

Dimion Asael; Zachary Ziegler; Yonatan Belinkov

arXiv:2108.14006·cs.CL·September 1, 2021·1 cites

A Generative Approach for Mitigating Structural Biases in Natural Language Inference

Dimion Asael, Zachary Ziegler, Yonatan Belinkov

PDF

Open Access 1 Repo

TL;DR

This paper proposes a generative approach to mitigate structural biases in natural language inference, reformulating the task to generate missing input parts conditioned on biased features, leading to unbiased models that are more robust but initially harder to train.

Contribution

It introduces a novel generative framework for NLI that enforces unbiasedness through a uniform prior, and demonstrates how fine-tuning can balance bias reduction with model performance.

Findings

01

Generative models are highly robust to bias in synthetic experiments.

02

The approach produces fully unbiased models on real NLI datasets.

03

Fine-tuning reduces the performance gap with discriminative models.

Abstract

Many natural language inference (NLI) datasets contain biases that allow models to perform well by only using a biased subset of the input, without considering the remainder features. For instance, models are able to make a classification decision by only using the hypothesis, without learning the true relationship between it and the premise. These structural biases lead discriminative models to learn unintended superficial features and to generalize poorly out of the training distribution. In this work, we reformulate the NLI task as a generative task, where a model is conditioned on the biased subset of the input and the label and generates the remaining subset of the input. We show that by imposing a uniform prior, we obtain a provably unbiased model. Through synthetic experiments, we find that this approach is highly robust to large amounts of bias. We then demonstrate empirically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

technion-cs-nlp/generative-nli
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis