Domain Adaptation under Missingness Shift

Helen Zhou; Sivaraman Balakrishnan; Zachary C. Lipton

arXiv:2211.02093·cs.LG·May 5, 2023·1 cites

Domain Adaptation under Missingness Shift

Helen Zhou, Sivaraman Balakrishnan, Zachary C. Lipton

PDF

Open Access 1 Repo

TL;DR

This paper introduces the problem of domain adaptation under missingness shift, analyzing how missing data mechanisms affect transferability and proposing methods for effective adaptation even with incomplete data.

Contribution

It formalizes DAMS, provides theoretical insights under missingness at random, and proposes an analytic adjustment for linear models to improve domain adaptation.

Findings

01

Covariate shift is violated without missingness indicators.

02

Optimal source predictor can perform arbitrarily worse on target.

03

Analytic adjustment yields consistent target parameter estimates.

Abstract

Rates of missing data often depend on record-keeping policies and thus may change across times and locations, even when the underlying features are comparatively stable. In this paper, we introduce the problem of Domain Adaptation under Missingness Shift (DAMS). Here, (labeled) source data and (unlabeled) target data would be exchangeable but for different missing data mechanisms. We show that if missing data indicators are available, DAMS reduces to covariate shift. Addressing cases where such indicators are absent, we establish the following theoretical results for underreporting completely at random: (i) covariate shift is violated (adaptation is required); (ii) the optimal linear source predictor can perform arbitrarily worse on the target domain than always predicting the mean; (iii) the optimal target predictor can be identified, even when the missingness rates themselves are not;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

acmi-lab/missingness-shift
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research