A general theory for preferential sampling in environmental networks
Joe Watson, James V. Zidek, Gavin Shaddick

TL;DR
This paper introduces a comprehensive model framework to detect and analyze preferential sampling in environmental monitoring networks, accounting for spatial, temporal, and dynamic site-selection processes, with practical applications to air pollution data in the UK.
Contribution
The paper develops a flexible, joint modeling framework for preferential sampling that can be easily implemented using R-INLA, incorporating dynamic site-selection processes and real-world factors.
Findings
Significant response-biased reduction in monitoring network occurred.
Network was unrepresentative of particulate matter levels across GB.
Potential overestimation of population-average exposure levels.
Abstract
This paper presents a general model framework for detecting the preferential sampling of environmental monitors recording an environmental process across space and/or time. This is achieved by considering the joint distribution of an environmental process with a site--selection process that considers where and when sites are placed to measure the process. The environmental process may be spatial, temporal or spatio--temporal in nature. By sharing random effects between the two processes, the joint model is able to establish whether site placement was stochastically dependent of the environmental process under study. The embedding into a spatio--temporal framework also allows for the modelling of the dynamic site---selection process itself. Real--world factors affecting both the size and location of the network can be easily modelled and quantified. Depending upon the choice of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
