Geostatistical Learning: Challenges and Opportunities

J\'ulio Hoffimann; Maciel Zortea; Breno de Carvalho; Bianca Zadrozny

arXiv:2102.08791·stat.ML·February 18, 2021

Geostatistical Learning: Challenges and Opportunities

J\'ulio Hoffimann, Maciel Zortea, Breno de Carvalho, Bianca Zadrozny

PDF

1 Repo

TL;DR

This paper discusses the unique challenges of applying statistical learning in geospatial contexts, highlighting issues like spatial correlation and covariate shifts, and evaluates existing error estimation methods which are often inadequate.

Contribution

It introduces the geostatistical learning problem, assesses current error estimation methods in geospatial settings, and offers practical guidelines amidst ongoing research for better solutions.

Findings

01

Existing error estimation methods are inadequate for geospatial data.

02

Spatial correlation and covariate shift significantly impact model generalization.

03

Guidelines are provided for model selection in geospatial machine learning.

Abstract

Statistical learning theory provides the foundation to applied machine learning, and its various successful applications in computer vision, natural language processing and other scientific domains. The theory, however, does not take into account the unique challenges of performing statistical learning in geospatial settings. For instance, it is well known that model errors cannot be assumed to be independent and identically distributed in geospatial (a.k.a. regionalized) variables due to spatial correlation; and trends caused by geophysical processes lead to covariate shifts between the domain where the model was trained and the domain where it will be applied, which in turn harm the use of classical learning methodologies that rely on random samples of the data. In this work, we introduce the geostatistical (transfer) learning problem, and illustrate the challenges of learning from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IBM/geostats-gen-error
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGaussian Process