Multi-Source Conformal Inference Under Distribution Shift
Yi Liu, Alexander W. Levis, Sharon-Lise Normand, Larry Han

TL;DR
This paper develops a distribution-free conformal inference method for multi-source data under distribution shifts, enabling valid prediction intervals while addressing bias, privacy, and heterogeneity issues.
Contribution
It introduces a novel approach to obtain valid prediction intervals in multi-source environments with biased data and distribution shifts, incorporating machine learning and adaptive weighting.
Findings
Method achieves nominal coverage probabilities in synthetic experiments.
Proposed approach improves efficiency by upweighting informative sources.
Application to hospital data demonstrates practical utility.
Abstract
Recent years have experienced increasing utilization of complex machine learning models across multiple sources of data to inform more generalizable decision-making. However, distribution shifts across data sources and privacy concerns related to sharing individual-level data, coupled with a lack of uncertainty quantification from machine learning predictions, make it challenging to achieve valid inferences in multi-source environments. In this paper, we consider the problem of obtaining distribution-free prediction intervals for a target population, leveraging multiple potentially biased data sources. We derive the efficient influence functions for the quantiles of unobserved outcomes in the target and source populations, and show that one can incorporate machine learning prediction algorithms in the estimation of nuisance functions while still achieving parametric rates of convergence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Advanced Causal Inference Techniques · Statistical Methods and Bayesian Inference
