Are Bias Mitigation Techniques for Deep Learning Effective?
Robik Shrestha, Kushal Kafle, Christopher Kanan

TL;DR
This paper critically evaluates the effectiveness of bias mitigation techniques in deep learning, introducing new evaluation protocols, datasets, and metrics to assess robustness and exposing limitations of current methods.
Contribution
It presents a standardized evaluation framework, a new dataset Biased MNIST, and a comprehensive analysis of seven state-of-the-art bias mitigation algorithms.
Findings
Algorithms exploit hidden biases.
Limited scalability to multiple bias types.
High sensitivity to tuning set choices.
Abstract
A critical problem in deep learning is that systems learn inappropriate biases, resulting in their inability to perform well on minority groups. This has led to the creation of multiple algorithms that endeavor to mitigate bias. However, it is not clear how effective these methods are. This is because study protocols differ among papers, systems are tested on datasets that fail to test many forms of bias, and systems have access to hidden knowledge or are tuned specifically to the test set. To address this, we introduce an improved evaluation protocol, sensible metrics, and a new dataset, which enables us to ask and answer critical questions about bias mitigation algorithms. We evaluate seven state-of-the-art algorithms using the same network architecture and hyperparameter selection policy across three benchmark datasets. We introduce a new dataset called Biased MNIST that enables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Adversarial Robustness in Machine Learning
