Efficient Inference under Label Shift in Unsupervised Domain Adaptation
Seong-ho Lee, Yanyuan Ma, Jiwei Zhao

TL;DR
This paper introduces a novel three-stage estimation method for efficient inference under label shift in unsupervised domain adaptation, addressing distribution differences between source and target data.
Contribution
It proposes a progressive estimation strategy for label shift, connecting it with prediction-powered inference, and establishes its asymptotic properties with demonstrated practical advantages.
Findings
The method outperforms existing approaches in simulations.
The approach is effective in real-world applications.
The three-stage estimation process is theoretically sound.
Abstract
In many real-world applications, researchers aim to deploy models trained in a source domain to a target domain, where obtaining labeled data is often expensive, time-consuming, or even infeasible. While most existing literature assumes that the labeled source data and the unlabeled target data follow the same distribution, distribution shifts are common in practice. This paper focuses on label shift and develops efficient inference procedures for general parameters characterizing the unlabeled target population. A central idea is to model the outcome density ratio between the labeled and unlabeled data. To this end, we propose a progressive estimation strategy that unfolds in three stages: an initial heuristic guess, a consistent estimation, and ultimately, an efficient estimation. This self-evolving process is novel in the statistical literature and of independent interest. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
