Aligning Validation with Deployment in Spatial Prediction: Target-Weighted Cross-Validation
Alexander Brenning, Thomas Suesse

TL;DR
This paper proposes a deployment-oriented validation framework using weighted cross-validation to improve performance estimates in spatial environmental modeling, especially under biased sampling conditions.
Contribution
It introduces Target-Weighted Cross-Validation (TWCV), a novel validation approach that aligns validation tasks with deployment prediction conditions using spatially meaningful descriptors.
Findings
Weighted CV reduces bias in performance estimates under biased sampling.
Standard CV can overestimate prediction error due to sampling bias.
TWCV provides more accurate performance assessment aligned with deployment conditions.
Abstract
Reliable estimation of predictive performance is essential for spatial environmental modeling, where machine-learning models are used to generate maps from unevenly distributed observations. Standard cross-validation (CV) assumes that validation data are representative of prediction conditions across the target domain. In practice, this assumption is often violated due to preferential or clustered sampling, leading to biased performance and uncertainty estimates. We introduce a deployment-oriented validation framework based on weighted CV that aligns validation tasks with the distribution of prediction tasks across a specified domain. The framework includes importance-weighted cross-validation (IWCV) and a calibration-based approach, Target-Weighted Cross-Validation (TWCV), which uses spatially meaningful task descriptors such as environmental covariates and prediction distance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
