Predicting is not Understanding: Recognizing and Addressing Underspecification in Machine Learning
Damien Teney, Maxime Peyrard, Ehsan Abbasnejad

TL;DR
This paper formalizes underspecification in machine learning, demonstrating how multiple models with similar in-domain accuracy can differ in out-of-distribution performance, and proposes a method to identify and improve models' OOD robustness.
Contribution
It introduces a formal framework for underspecification and a novel method to discover and incorporate meaningful features that enhance OOD performance.
Findings
Models with different functions can have similar in-domain accuracy.
Discovered features improve out-of-distribution generalization.
In-domain accuracy alone is insufficient for reliable model selection.
Abstract
Machine learning (ML) models are typically optimized for their accuracy on a given dataset. However, this predictive criterion rarely captures all desirable properties of a model, in particular how well it matches a domain expert's understanding of a task. Underspecification refers to the existence of multiple models that are indistinguishable in their in-domain accuracy, even though they differ in other desirable properties such as out-of-distribution (OOD) performance. Identifying these situations is critical for assessing the reliability of ML models. We formalize the concept of underspecification and propose a method to identify and partially address it. We train multiple models with an independence constraint that forces them to implement different functions. They discover predictive features that are otherwise ignored by standard empirical risk minimization (ERM), which we then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
MethodsALIGN
