A Generative Approach for Mitigating Structural Biases in Natural Language Inference
Dimion Asael, Zachary Ziegler, Yonatan Belinkov

TL;DR
This paper proposes a generative approach to mitigate structural biases in natural language inference, reformulating the task to generate missing input parts conditioned on biased features, leading to unbiased models that are more robust but initially harder to train.
Contribution
It introduces a novel generative framework for NLI that enforces unbiasedness through a uniform prior, and demonstrates how fine-tuning can balance bias reduction with model performance.
Findings
Generative models are highly robust to bias in synthetic experiments.
The approach produces fully unbiased models on real NLI datasets.
Fine-tuning reduces the performance gap with discriminative models.
Abstract
Many natural language inference (NLI) datasets contain biases that allow models to perform well by only using a biased subset of the input, without considering the remainder features. For instance, models are able to make a classification decision by only using the hypothesis, without learning the true relationship between it and the premise. These structural biases lead discriminative models to learn unintended superficial features and to generalize poorly out of the training distribution. In this work, we reformulate the NLI task as a generative task, where a model is conditioned on the biased subset of the input and the label and generates the remaining subset of the input. We show that by imposing a uniform prior, we obtain a provably unbiased model. Through synthetic experiments, we find that this approach is highly robust to large amounts of bias. We then demonstrate empirically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
