Bridging observational studies and randomized experiments by embedding the former in the latter
Marie-Abele C. Bind, Donald B. Rubin

TL;DR
This paper introduces a four-stage strategy to estimate causal effects of environmental exposures from observational data by structuring it to resemble a randomized experiment, enhancing causal inference in health studies.
Contribution
It presents a novel framework combining conceptual, design, analysis, and summary stages to approximate randomized experiments from observational data, with practical diagnostics.
Findings
Applied to parental smoking and children's lung function in 1970s East Boston
Demonstrated the approach's ability to quantify exposure effects
Provided diagnostics to validate assumptions of the causal inference method
Abstract
The health effects of environmental exposures have been studied for decades, typically using standard regression models to assess exposure-outcome associations found in observational non-experimental data. We propose and illustrate a different approach to examine causal effects of environmental exposures on health outcomes from observational data. Our strategy attempts to structure the observational data to approximate data from a hypothetical, but realistic, randomized experiment. This approach, based on insights from classical experimental design, involves four stages, and relies on modern computing to implement the effort in two of the four stages.More specifically, our strategy involves: 1) a conceptual stage that involves the precise formulation of the causal question in terms of a hypothetical randomized experiment where the exposure is assigned to units; 2) a design stage that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
