Designing Experiments Informed by Observational Studies
Evan Rosenman, Art B. Owen

TL;DR
This paper introduces a novel method for designing more efficient experiments by using observational data to inform the experimental setup, specifically focusing on stratified experiments with binary outcomes.
Contribution
It presents a new approach that leverages observational data to optimize experimental design, avoiding the bias-variance tradeoff typical in data fusion methods.
Findings
Method effectively incorporates observational data into experimental design.
Convex optimization approach simplifies the design problem.
Demonstrated utility with Women's Health Initiative data.
Abstract
The increasing availability of passively observed data has yielded a growing methodological interest in "data fusion." These methods involve merging data from observational and experimental sources to draw causal conclusions -- and they typically require a precarious tradeoff between the unknown bias in the observational dataset and the often-large variance in the experimental dataset. We propose an alternative approach to leveraging observational data, which avoids this tradeoff: rather than using observational data for inference, we use it to design a more efficient experiment. We consider the case of a stratified experiment with a binary outcome, and suppose pilot estimates for the stratum potential outcome variances can be obtained from the observational study. We extend results from Zhao et al. (2019) in order to generate confidence sets for these variances, while accounting for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
