TL;DR
This paper introduces FG-CCDB, a fine-grained distribution matching method for debiased learning, leveraging a multi-stage re-training strategy to better address spurious correlations without bias annotations.
Contribution
It proposes FG-CCDB, a novel fine-grained distribution matching technique that improves debiased learning by using confusion-cell-wise reweighting and a multi-stage re-training strategy.
Findings
FG-CCDB outperforms bias-supervised methods in multi-class and shortcut scenarios.
MST effectively approximates ground-truth bias annotations.
FG-CCDB achieves comparable results to bias-supervised approaches in binary classification.
Abstract
Achieving group-robust generalization in the presence of spurious correlations remains a significant challenge, particularly when bias annotations are unavailable. Recent studies on Class-Conditional Distribution Balancing (CCDB) reveal that spurious correlations often stem from mismatches between the class-conditional and marginal distributions of bias attributes. They achieve promising results by addressing this issue through simple distribution matching in a bias-agnostic manner. However, CCDB approximates each distribution using a single Gaussian, which is overly simplistic and rarely holds in real-world applications. To address this limitation, we propose a novel Multi-stage data-Selective reTraining strategy (MST), which describes each distribution in greater detail using the hard confusion matrix. Building on these finer descriptions, we propose a fine-grained variant of CCDB,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
