Principal Component Pursuit for Pattern Identification in Environmental Mixtures
Elizabeth A. Gibson (1), Junhui Zhang (2), Jingkai Yan (2), Lawrence, Chillrud (1), Jaime Benavides (1), Yanelli Nunez (1), Julie B. Herbstman (1),, Jeff Goldsmith (3), John Wright (2), and Marianthi-Anna Kioumourtzoglou (1), ((1) Department of Environmental Health Sciences

TL;DR
This paper adapts principal component pursuit (PCP) for environmental mixture analysis, effectively identifying exposure patterns even with missing or below detection limit data, outperforming PCA in simulations and real data.
Contribution
The study introduces PCP-LOD, a novel adaptation of PCP for environmental data, handling non-negative, missing, and below detection limit values, improving pattern detection over PCA.
Findings
PCP-LOD accurately recovered true patterns in simulations.
PCP-LOD outperformed PCA in predictive error with high <LOD data.
Identified meaningful exposure patterns in real POP data.
Abstract
Environmental health researchers often aim to identify sources/behaviors that give rise to potentially harmful exposures. We adapted principal component pursuit (PCP)-a robust technique for dimensionality reduction in computer vision and signal processing-to identify patterns in environmental mixtures. PCP decomposes the exposure mixture into a low-rank matrix containing consistent exposure patterns across pollutants and a sparse matrix isolating unique exposure events. We adapted PCP to accommodate non-negative and missing data, and values below a given limit of detection (LOD). We simulated data to represent environmental mixtures of two sizes with increasing proportions <LOD and three noise structures. We compared PCP-LOD to principal component analysis (PCA) to evaluate performance. We next applied PCP-LOD to a mixture of 21 persistent organic pollutants (POPs) measured in 1,000…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth, Environment, Cognitive Aging · Metabolomics and Mass Spectrometry Studies
