TL;DR
This paper introduces a scalable land-use regression model with penalization and Gaussian process approximation to accurately predict daily ground-level NO$_2$ concentrations across the US, aiding health risk assessments.
Contribution
It develops a novel scalable method combining variable selection, spatiotemporal modeling, and Gaussian process approximation for land-use regression of air pollution data.
Findings
Improved model selection sensitivity and specificity.
Better prediction accuracy and interpretability.
Revealed spatiotemporal NO$_2$ patterns across the US.
Abstract
Nitrogen dioxide (NO) is a primary constituent of traffic-related air pollution and has well established harmful environmental and human-health impacts. Knowledge of the spatiotemporal distribution of NO is critical for exposure and risk assessment. A common approach for assessing air pollution exposure is linear regression involving spatially referenced covariates, known as land-use regression (LUR). We develop a scalable approach for simultaneous variable selection and estimation of LUR models with spatiotemporally correlated errors, by combining a general-Vecchia Gaussian process approximation with a penalty on the LUR coefficients. In comparisons to existing methods using simulated data, our approach resulted in higher model-selection specificity and sensitivity and in better prediction in terms of calibration and sharpness, for a wide range of relevant settings. In our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
