TL;DR
This paper introduces a Bayesian method to estimate population average causal effects in non-overlap regions, combining data-driven overlap definitions with flexible modeling to improve causal inference in environmental health studies.
Contribution
It proposes a new data-driven definition of overlap and a Bayesian framework that separates modeling in overlap and non-overlap regions, enhancing causal effect estimation with minimal assumptions.
Findings
Method performs well in simulations
Applied to natural gas station exposure and cancer mortality
Provides more reliable population causal estimates
Abstract
Most causal inference studies rely on the assumption of overlap to estimate population or sample average causal effects. When data exhibit non-overlap, estimation of these estimands requires reliance on model specifications, due to poor data support. All existing methods to address non-overlap, such as trimming or down-weighting data in regions of poor support, change the estimand. In environmental health research, where study results are often intended to influence policy, changes in the estimand can diminish the study's impact, because estimates may not be representative of effects in the population of interest to policymakers. Researchers may be willing to make additional, minimal modeling assumptions in order to preserve the ability to estimate population average causal effects. We seek to make two contributions on this topic. First, we propose a flexible, data-driven definition of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
