Distributionally Robust Optimization and Invariant Representation Learning for Addressing Subgroup Underrepresentation: Mechanisms and Limitations
Nilesh Kumar, Ruby Shrestha, Zhiyuan Li, Linwei Wang

TL;DR
This paper investigates mechanisms to mitigate bias caused by subgroup underrepresentation in medical image classification, proposing a novel robust optimization approach to improve invariant representation learning and reduce performance disparities.
Contribution
It introduces a new robust optimization method for invariant representation learning to address subgroup underrepresentation bias in deep neural networks.
Findings
Reweighting underperforming samples can be problematic when bias isn't the only issue.
Naive invariant representation learning can itself be biased by spurious correlations.
The proposed robust optimization approach improves subgroup fairness while maintaining overall performance.
Abstract
Spurious correlation caused by subgroup underrepresentation has received increasing attention as a source of bias that can be perpetuated by deep neural networks (DNNs). Distributionally robust optimization has shown success in addressing this bias, although the underlying working mechanism mostly relies on upweighting under-performing samples as surrogates for those underrepresented in data. At the same time, while invariant representation learning has been a powerful choice for removing nuisance-sensitive features, it has been little considered in settings where spurious correlations are caused by significant underrepresentation of subgroups. In this paper, we take the first step to better understand and improve the mechanisms for debiasing spurious correlation due to subgroup underrepresentation in medical image classification. Through a comprehensive evaluation study, we first show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Imbalanced Data Classification Techniques · Machine Learning and Data Classification
