Delving into Identify-Emphasize Paradigm for Combating Unknown Bias
Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He, Shu-Tao Xia

TL;DR
This paper enhances the identify-emphasize paradigm for unknown bias mitigation by introducing a bias-conflicting scoring method, gradient alignment, and self-supervised learning, leading to improved model robustness and state-of-the-art results.
Contribution
It proposes novel techniques including ECS, gradient alignment, and self-supervised pretraining to address bias identification and emphasis challenges in unknown bias scenarios.
Findings
Improved bias-conflicting sample identification accuracy.
Dynamic balancing of bias-aligned and bias-conflicting contributions.
Achieved state-of-the-art performance on multiple datasets.
Abstract
Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still plagued by two challenges: A, the quality of the identified bias-conflicting samples is far from satisfactory; B, the emphasizing strategies only produce suboptimal performance. In this paper, for challenge A, we propose an effective bias-conflicting scoring method (ECS) to boost the identification accuracy, along with two practical strategies -- peer-picking and epoch-ensemble. For challenge B, we point out that the gradient contribution statistics can be a reliable indicator to inspect whether the optimization is dominated by bias-aligned samples. Then, we propose gradient alignment (GA), which employs gradient statistics to balance the contributions of the mined bias-aligned and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
