Predicting is not Understanding: Recognizing and Addressing   Underspecification in Machine Learning

Damien Teney; Maxime Peyrard; Ehsan Abbasnejad

arXiv:2207.02598·cs.LG·July 7, 2022

Predicting is not Understanding: Recognizing and Addressing Underspecification in Machine Learning

Damien Teney, Maxime Peyrard, Ehsan Abbasnejad

PDF

Open Access

TL;DR

This paper formalizes underspecification in machine learning, demonstrating how multiple models with similar in-domain accuracy can differ in out-of-distribution performance, and proposes a method to identify and improve models' OOD robustness.

Contribution

It introduces a formal framework for underspecification and a novel method to discover and incorporate meaningful features that enhance OOD performance.

Findings

01

Models with different functions can have similar in-domain accuracy.

02

Discovered features improve out-of-distribution generalization.

03

In-domain accuracy alone is insufficient for reliable model selection.

Abstract

Machine learning (ML) models are typically optimized for their accuracy on a given dataset. However, this predictive criterion rarely captures all desirable properties of a model, in particular how well it matches a domain expert's understanding of a task. Underspecification refers to the existence of multiple models that are indistinguishable in their in-domain accuracy, even though they differ in other desirable properties such as out-of-distribution (OOD) performance. Identifying these situations is critical for assessing the reliability of ML models. We formalize the concept of underspecification and propose a method to identify and partially address it. We train multiple models with an independence constraint that forces them to implement different functions. They discover predictive features that are otherwise ignored by standard empirical risk minimization (ERM), which we then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning

MethodsALIGN