Causal Inference with Double/Debiased Machine Learning for Evaluating the Health Effects of Multiple Mismeasured Pollutants
Gang Xu, Xin Zhou, Molin Wang, Boya Zhang, Wenhao Jiang, Francine, Laden, Helen H. Suh, Adam A. Szpiro, Donna Spiegelman, Zuoheng Wang

TL;DR
This paper develops a double/debiased machine learning approach combined with regression calibration to accurately estimate the causal effects of specific air pollution constituents on health outcomes, accounting for measurement error and correlations.
Contribution
It introduces a novel method integrating DML with measurement error correction for causal inference in multi-pollutant studies, validated through simulations and real data application.
Findings
Identified two PM2.5 constituents with negative causal effects on cognitive function.
Demonstrated the estimator reduces bias and achieves nominal coverage in simulations.
Applied method to real data, revealing specific pollutants' health impacts.
Abstract
One way to quantify exposure to air pollution and its constituents in epidemiologic studies is to use an individual's nearest monitor. This strategy results in potential inaccuracy in the actual personal exposure, introducing bias in estimating the health effects of air pollution and its constituents, especially when evaluating the causal effects of correlated multi-pollutant constituents measured with correlated error. This paper addresses estimation and inference for the causal effect of one constituent in the presence of other PM2.5 constituents, accounting for measurement error and correlations. We used a linear regression calibration model, fitted with generalized estimating equations in an external validation study, and extended a double/debiased machine learning (DML) approach to correct for measurement error and estimate the effect of interest in the main study. We demonstrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWater Quality and Pollution Assessment
MethodsLinear Regression
