Spatial regionalization based on optimal information compression
Alec Kirkley

TL;DR
This paper introduces a parameter-free, data-driven regionalization method based on the minimum description length principle, capable of identifying natural spatial clusters without prior assumptions, and demonstrates its effectiveness on synthetic and real demographic data.
Contribution
The paper presents a novel, efficient, and parameter-free regionalization algorithm that uses data compression principles to identify natural spatial regions from data alone.
Findings
Successfully recovers spatial clusters in synthetic data
Coarse-grains real demographic data meaningfully
Reveals increased complexity in urban segregation over time
Abstract
Regionalization, spatially contiguous clustering, provides a means to reduce the effect of noise in sampled data and identify homogeneous areas for policy development among many other applications. Existing regionalization methods require user input such as the number of regions or a similarity measure between regions, which does not allow for the extraction of the natural regions defined solely by the data itself. Here we view the problem of regionalization as one of data compression and develop an efficient, parameter-free regionalization algorithm based on the minimum description length principle. We demonstrate that our method is capable of recovering planted spatial clusters in noisy synthetic data, and that it can meaningfully coarse-grain real demographic data. Using our description length formulation, we find that spatial ethnoracial data in U.S. metropolitan areas has become…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Mobility and Location-Based Analysis · Spatial and Panel Data Analysis · Land Use and Ecosystem Services
