Accuracy on the wrong line: On the pitfalls of noisy data for out-of-distribution generalisation
Amartya Sanyal, Yaxi Hu, Yaodong Yu, Yian Ma, Yixin Wang, Bernhard, Sch\"olkopf

TL;DR
This paper investigates how noisy data and nuisance features can cause the positive correlation between in-distribution and out-of-distribution accuracy to break down, leading to negative correlation and increased OOD errors, even in larger datasets.
Contribution
It formally analyzes the conditions under which accuracy on in-distribution and out-of-distribution data become negatively correlated due to noise and nuisance features, and demonstrates this phenomenon empirically.
Findings
Noisy data can cause the accuracy correlation to reverse.
Nuisance features can overshadow core features, worsening OOD performance.
Scaling datasets does not necessarily improve robustness to noise.
Abstract
"Accuracy-on-the-line" is a widely observed phenomenon in machine learning, where a model's accuracy on in-distribution (ID) and out-of-distribution (OOD) data is positively correlated across different hyperparameters and data configurations. But when does this useful relationship break down? In this work, we explore its robustness. The key observation is that noisy data and the presence of nuisance features can be sufficient to shatter the Accuracy-on-the-line phenomenon. In these cases, ID and OOD accuracy can become negatively correlated, leading to "Accuracy-on-the-wrong-line". This phenomenon can also occur in the presence of spurious (shortcut) features, which tend to overshadow the more complex signal (core, non-spurious) features, resulting in a large nuisance feature space. Moreover, scaling to larger datasets does not mitigate this undesirable behavior and may even exacerbate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Process Monitoring · Image and Signal Denoising Methods
