Minimax Rates and Adaptivity in Combining Experimental and Observational Data
Shuxiao Chen, Bo Zhang, Ting Ye

TL;DR
This paper explores how integrating observational data with randomized controlled trials can improve effect estimation efficiency, revealing that adaptation is possible for estimation but impossible for inference due to confounding bias.
Contribution
It provides the first theoretical minimax analysis of data integration from RCTs and observational studies, proposing an adaptive estimator and highlighting fundamental limits for inference.
Findings
Adaptive estimator attains optimal convergence rate for estimation.
Efficiency gain from data integration is achievable without knowing confounding bias.
Adaptation for confidence interval length is generally impossible due to confounding.
Abstract
Randomized controlled trials (RCTs) are the gold standard for evaluating the causal effect of a treatment; however, they often have limited sample sizes and sometimes poor generalizability. On the other hand, non-randomized, observational data derived from large administrative databases have massive sample sizes and better generalizability, but they are prone to unmeasured confounding bias. It is thus of considerable interest to reconcile effect estimates obtained from randomized controlled trials and observational studies investigating the same intervention, potentially harvesting the best from both realms. In this paper, we theoretically characterize the potential efficiency gain of integrating observational data into the RCT-based analysis from a minimax point of view. For estimation, we derive the minimax rate of convergence for the mean squared error, and propose a fully adaptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
