Predicting with Confidence on Unseen Distributions

Devin Guillory; Vaishaal Shankar; Sayna Ebrahimi; Trevor Darrell,; Ludwig Schmidt

arXiv:2107.03315·cs.LG·August 23, 2021

Predicting with Confidence on Unseen Distributions

Devin Guillory, Vaishaal Shankar, Sayna Ebrahimi, Trevor Darrell,, Ludwig Schmidt

PDF

TL;DR

This paper introduces a simple yet effective method called Difference of Confidences (DoC) to predict model performance on unseen data distributions, significantly improving accuracy estimation under distribution shifts.

Contribution

The work demonstrates that DoC reliably estimates performance change under distribution shift, outperforming traditional distributional distances and enabling better performance prediction without labeled data.

Findings

01

DoC reduces predictive error by 46% on challenging datasets.

02

Traditional distributional distances fail to reliably estimate performance under shift.

03

DoC outperforms other measures in both synthetic and natural distribution shifts.

Abstract

Recent work has shown that the performance of machine learning models can vary substantially when models are evaluated on data drawn from a distribution that is close to but different from the training distribution. As a result, predicting model performance on unseen distributions is an important challenge. Our work connects techniques from domain adaptation and predictive uncertainty literature, and allows us to predict model accuracy on challenging unseen distributions without access to labeled data. In the context of distribution shift, distributional distances are often used to adapt models and improve their performance on new domains, however accuracy estimation, or other forms of predictive uncertainty, are often neglected in these investigations. Through investigating a wide range of established distributional distances, such as Frechet distance or Maximum Mean Discrepancy, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.