A novel principal component analysis for spatially-misaligned multivariate air pollution data
Roman A. Jandarov, Lianne A. Sheppard, Paul D. Sampson, Adam A. Szpiro

TL;DR
This paper introduces a new predictive sparse PCA method tailored for spatially-misaligned air pollution data, enabling better identification of pollutant mixtures and their health impacts in cohort studies.
Contribution
The paper presents a novel predictive sparse PCA approach that handles spatial misalignment, improving pollutant mixture analysis and health effect quantification.
Findings
Effective in simulated data scenarios
Successfully applied to EPA particulate matter data
Enhances understanding of pollutant health impacts
Abstract
We propose novel methods for predictive (sparse) PCA with spatially misaligned data. These methods identify principal component loading vectors that explain as much variability in the observed data as possible, while also ensuring the corresponding principal component scores can be predicted accurately by means of spatial statistics at locations where air pollution measurements are not available. This will make it possible to identify important mixtures of air pollutants and to quantify their health effects in cohort studies, where currently available methods cannot be used. We demonstrate the utility of predictive (sparse) PCA in simulated data and apply the approach to annual averages of particulate matter speciation data from national Environmental Protection Agency (EPA) regulatory monitors.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAir Quality and Health Impacts · Economic and Environmental Valuation · Urban Transport and Accessibility
