Learning to Split for Automatic Bias Detection
Yujia Bao, Regina Barzilay

TL;DR
This paper introduces Learning to Split (ls), an algorithm that automatically detects biases in datasets by creating challenging data splits, enabling better de-biasing and improving worst-group performance across various tasks.
Contribution
We propose a task-agnostic algorithm that automatically identifies biased data splits to facilitate bias detection and mitigation in supervised learning.
Findings
ls generates challenging splits that align with human-identified biases.
Combining ls with robust algorithms improves worst-group performance by 23.4%.
ls is applicable across diverse domains like NLP, image, and molecular data.
Abstract
Classifiers are biased when trained on biased datasets. As a remedy, we propose Learning to Split (ls), an algorithm for automatic bias detection. Given a dataset with input-label pairs, ls learns to split this dataset so that predictors trained on the training split cannot generalize to the testing split. This performance gap suggests that the testing split is under-represented in the dataset, which is a signal of potential bias. Identifying non-generalizable splits is challenging since we have no annotations about the bias. In this work, we show that the prediction correctness of each example in the testing split can be used as a source of weak supervision: generalization performance will drop if we move examples that are predicted correctly away from the testing split, leaving only those that are mis-predicted. ls is task-agnostic and can be applied to any supervised learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Imbalanced Data Classification Techniques
