Don't Discard All the Biased Instances: Investigating a Core Assumption   in Dataset Bias Mitigation Techniques

Hossein Amirkhani; Mohammad Taher Pilehvar

arXiv:2109.00521·cs.LG·September 3, 2021

Don't Discard All the Biased Instances: Investigating a Core Assumption in Dataset Bias Mitigation Techniques

Hossein Amirkhani, Mohammad Taher Pilehvar

PDF

1 Repo

TL;DR

This paper challenges the core assumption in dataset bias mitigation techniques that biased models predict the main model's behavior, showing that they often do not, which questions the effectiveness of current bias mitigation strategies.

Contribution

It critically investigates the assumption underlying bias mitigation methods, revealing that biased models often do not align with main models in their decision basis, and questions the practice of down-weighting biased instances.

Findings

01

Biased models and main models often rely on different input parts for decisions.

02

In 33-50% of instances, biased models fail to predict main model behavior.

03

Down-weighting biased instances may be an unnecessary waste of training data.

Abstract

Existing techniques for mitigating dataset bias often leverage a biased model to identify biased instances. The role of these biased instances is then reduced during the training of the main model to enhance its robustness to out-of-distribution data. A common core assumption of these techniques is that the main model handles biased instances similarly to the biased model, in that it will resort to biases whenever available. In this paper, we show that this assumption does not hold in general. We carry out a critical investigation on two well-known datasets in the domain, MNLI and FEVER, along with two biased instance detection methods, partial-input and limited-capacity models. Our experiments show that in around a third to a half of instances, the biased model is unable to predict the main model's behavior, highlighted by the significantly different parts of the input on which they…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

h-amirkhani/debiasing-assumption
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.