Variable importance measure for spatial machine learning models with application to air pollution exposure prediction
Si Cheng, Magali N. Blanco, Lianne Sheppard, Ali Shojaie, Adam Szpiro

TL;DR
This paper introduces a leave-one-out variable importance method for spatial machine learning models, enhancing interpretability and comparison in air pollution exposure prediction, with applications to sulfur and ultrafine particles datasets.
Contribution
The paper presents a novel variable importance approach for spatial models that separates mean and covariance effects, improving interpretability in air pollution exposure modeling.
Findings
The method provides interpretable importance measures for various spatial models.
It reveals differences in model mechanisms despite similar prediction accuracy.
Application to sulfur and UFP datasets demonstrates practical utility.
Abstract
Exposure assessment is fundamental to air pollution cohort studies. The objective is to predict air pollution exposures for study subjects at locations without data in order to optimize our ability to learn about health effects of air pollution. In addition to generating accurate predictions to minimize exposure measurement error, understanding the mechanism captured by the model is another crucial aspect that may not always be straightforward due to the complex nature of machine learning methods, as well as the lack of unifying notions of variable importance. This is further complicated in air pollution modeling by the presence of spatial correlation. We tackle these challenges in two datasets: sulfur (S) from regulatory United States national PM2.5 sub-species data and ultrafine particles (UFP) from a new Seattle-area traffic-related air pollution dataset. Our key contribution is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Quality and Health Impacts
