Estimate Level Adjustment For Inference With Proxies Under Random Distribution Shifts
Steven Wilkins-Reeves, Alexandra N. M. Darmon, Deeksha Sinha

TL;DR
This paper proposes a new empirical calibration method for proxy-based inference under distribution shifts, modeling the discrepancy as a random effect and leveraging historical data to improve accuracy.
Contribution
It introduces an estimate-level framework inspired by domain adaptation that calibrates proxy inference without needing individual response data, enhancing existing correction methods.
Findings
The method effectively calibrates proxy inference across multiple domains.
It improves bias correction when distribution shifts occur.
Validation on real datasets demonstrates practical utility.
Abstract
In many scientific domains, including experimentation, researchers rely on measurements of proxy outcomes to achieve faster and more frequent reads, especially when the primary outcome of interest is challenging to measure directly. While proxies offer a more readily accessible observation for inference, the ultimate goal is to draw statistical inferences about the primary outcome parameter and proxy data are typically imperfect in some ways. To correct for these imperfections, current statistical inference methods often depend on strict identifying assumptions (such as surrogacy, covariate/label shift, or missingness assumptions). These assumptions can be difficult to validate and may be violated by various additional sources of distribution shift, potentially leading to biased parameter estimates and miscalibrated uncertainty quantification. We introduce an estimate-level framework,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
