TL;DR
This paper discusses the unique challenges of applying statistical learning in geospatial contexts, highlighting issues like spatial correlation and covariate shifts, and evaluates existing error estimation methods which are often inadequate.
Contribution
It introduces the geostatistical learning problem, assesses current error estimation methods in geospatial settings, and offers practical guidelines amidst ongoing research for better solutions.
Findings
Existing error estimation methods are inadequate for geospatial data.
Spatial correlation and covariate shift significantly impact model generalization.
Guidelines are provided for model selection in geospatial machine learning.
Abstract
Statistical learning theory provides the foundation to applied machine learning, and its various successful applications in computer vision, natural language processing and other scientific domains. The theory, however, does not take into account the unique challenges of performing statistical learning in geospatial settings. For instance, it is well known that model errors cannot be assumed to be independent and identically distributed in geospatial (a.k.a. regionalized) variables due to spatial correlation; and trends caused by geophysical processes lead to covariate shifts between the domain where the model was trained and the domain where it will be applied, which in turn harm the use of classical learning methodologies that rely on random samples of the data. In this work, we introduce the geostatistical (transfer) learning problem, and illustrate the challenges of learning from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGaussian Process
