Self-training Avoids Using Spurious Features Under Domain Shift
Yining Chen, Colin Wei, Ananya Kumar, Tengyu Ma

TL;DR
This paper demonstrates that self-training with entropy minimization can effectively avoid spurious features under large domain shifts in unsupervised domain adaptation, especially when initialized with a good source classifier.
Contribution
The paper provides a theoretical analysis showing that entropy minimization can prevent reliance on spurious features under certain conditions, even with large domain shifts.
Findings
Entropy minimization avoids spurious features when initialized properly.
Theoretical guarantees hold for Gaussian spurious and log-concave non-spurious features.
Empirical validation on semi-synthetic Celeb-A and MNIST datasets.
Abstract
In unsupervised domain adaptation, existing theory focuses on situations where the source and target domains are close. In practice, conditional entropy minimization and pseudo-labeling work even when the domain shifts are much larger than those analyzed by existing theory. We identify and analyze one particular setting where the domain shift can be large, but these algorithms provably work: certain spurious features correlate with the label in the source domain but are independent of the label in the target. Our analysis considers linear classification where the spurious features are Gaussian and the non-spurious features are a mixture of log-concave distributions. For this setting, we prove that entropy minimization on unlabeled target data will avoid using the spurious feature if initialized with a decently accurate source classifier, even though the objective is non-convex and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Multimodal Machine Learning Applications
