Graph-Based Spatial Segmentation of Health-Related Areal Data
Vivien Goepp, Jan van de Kassteele

TL;DR
This paper introduces a fast, scalable method for piecewise-constant spatial estimation on large graphs, improving interpretability and identifying health zones in areal data.
Contribution
The authors propose a new graph-based segmentation method that is computationally efficient and produces sparser, more interpretable models compared to existing approaches like the fused lasso.
Findings
Successfully applied to real health data in the Netherlands
Identified health zones not explained by demographic factors
Demonstrated improved scalability and interpretability
Abstract
Smoothing is often used to improve the readability and interpretability of noisy areal data. However there are many instances where the underlying quantity is discontinuous. In this case, specific methods are needed to estimate the piecewise constant spatial process. A well-known approach in this setting is to perform segmentation of the signal using the adjacency graph, as does the graph-based fused lasso. But this method does not scale well to large graphs. This article introduces a new method for piecewise-constant spatial estimation that (i) is fast to compute on large graphs and (ii) yields sparser models than the fused lasso (for the same amount of regularization), giving estimates that are easier to interpret. We illustrate our method on simulated data and apply it to real data on overweight prevalence in the Netherlands. Healthy and unhealthy zones are identified which cannot be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Spatial and Panel Data Analysis · Point processes and geometric inequalities
