Demystifying Disagreement-on-the-Line in High Dimensions

Donghwan Lee; Behrad Moniri; Xinmeng Huang; Edgar Dobriban; Hamed; Hassani

arXiv:2301.13371·stat.ML·March 2, 2023

Demystifying Disagreement-on-the-Line in High Dimensions

Donghwan Lee, Behrad Moniri, Xinmeng Huang, Edgar Dobriban, Hamed, Hassani

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper provides a theoretical analysis of disagreement-on-the-line phenomenon in high-dimensional models, linking disagreement and error across domains, supported by experiments on multiple datasets.

Contribution

It develops a theoretical framework for understanding disagreement in high-dimensional regression and identifies conditions for the disagreement-on-the-line phenomenon.

Findings

01

Disagreement correlates with prediction error in high dimensions.

02

The disagreement-on-the-line phenomenon occurs under specific conditions.

03

Experimental results align with theoretical predictions across datasets.

Abstract

Evaluating the performance of machine learning models under distribution shift is challenging, especially when we only have unlabeled data from the shifted (target) domain, along with labeled data from the original (source) domain. Recent work suggests that the notion of disagreement, the degree to which two models trained with different randomness differ on the same input, is a key to tackle this problem. Experimentally, disagreement and prediction error have been shown to be strongly connected, which has been used to estimate model performance. Experiments have led to the discovery of the disagreement-on-the-line phenomenon, whereby the classification error under the target domain is often a linear function of the classification error under the source domain; and whenever this property holds, disagreement under the source and target domain follow the same linear relation. In this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dh7401/rf-disagreement
noneOfficial

Videos

Demystifying Disagreement-on-the-Line in High Dimensions· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Statistical Methods and Inference · Adversarial Robustness in Machine Learning