Masked Images Are Counterfactual Samples for Robust Fine-tuning
Yao Xiao, Ziyi Tang, Pengxu Wei, Cong Liu, Liang Lin

TL;DR
This paper introduces a novel fine-tuning approach using masked images as counterfactual samples to enhance model robustness against distribution shifts, effectively balancing in-distribution accuracy and out-of-distribution robustness.
Contribution
The proposed method leverages masked images as counterfactuals for fine-tuning, explicitly addressing OOD robustness and improving the ID-OOD performance trade-off.
Findings
Outperforms previous methods on OOD robustness benchmarks
Improves the trade-off between in-distribution and out-of-distribution performance
Uses class activation maps to generate effective counterfactual samples
Abstract
Deep learning models are challenged by the distribution shift between the training data and test data. Recently, the large models pre-trained on diverse data have demonstrated unprecedented robustness to various distribution shifts. However, fine-tuning these models can lead to a trade-off between in-distribution (ID) performance and out-of-distribution (OOD) robustness. Existing methods for tackling this trade-off do not explicitly address the OOD robustness problem. In this paper, based on causal analysis of the aforementioned problems, we propose a novel fine-tuning method, which uses masked images as counterfactual samples that help improve the robustness of the fine-tuning model. Specifically, we mask either the semantics-related or semantics-unrelated patches of the images based on class activation map to break the spurious correlation, and refill the masked patches with patches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsTest
