ClustGeo: an R package for hierarchical clustering with spatial constraints
Marie Chavent, Vanessa Kuentz-Simonet, Amaury Labenne and, J\'er\^ome Saracco

TL;DR
ClustGeo introduces a hierarchical clustering method that integrates spatial constraints by combining feature dissimilarities with geographical information, enhancing clustering quality with spatial contiguity.
Contribution
The paper presents a novel Ward-like hierarchical clustering algorithm that incorporates spatial constraints through a convex combination of dissimilarity matrices, adaptable to non-Euclidean data and non-uniform weights.
Findings
Effective integration of spatial constraints improves cluster contiguity.
Flexible dissimilarity measures accommodate various data types.
Application demonstrates practical utility on real spatial datasets.
Abstract
In this paper, we propose a Ward-like hierarchical clustering algorithm including spatial/geographical constraints. Two dissimilarity matrices and are inputted, along with a mixing parameter . The dissimilarities can be non-Euclidean and the weights of the observations can be non-uniform. The first matrix gives the dissimilarities in the "feature space" and the second matrix gives the dissimilarities in the "constraint space". The criterion minimized at each stage is a convex combination of the homogeneity criterion calculated with and the homogeneity criterion calculated with . The idea is then to determine a value of which increases the spatial contiguity without deteriorating too much the quality of the solution based on the variables of interest i.e. those of the feature space. This procedure is illustrated on a real dataset using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
