Scalable model selection for spatial additive mixed modeling: application to crime analysis
Daisuke Murakami, Mami Kajita, Seiji Kajita

TL;DR
This paper introduces a fast, scalable method for selecting spatial regression models suitable for large datasets, demonstrated through crime analysis in Japan, improving both accuracy and computational efficiency.
Contribution
It develops a novel dimension reduction-based model selection approach for large-scale spatial regression, implemented in the R package spmoran.
Findings
Accurately selects models in large datasets
Significantly speeds up computation
Effectively predicts crime risk factors
Abstract
A rapid growth in spatial open datasets has led to a huge demand for regression approaches accommodating spatial and non-spatial effects in big data. Regression model selection is particularly important to stably estimate flexible regression models. However, conventional methods can be slow for large samples. Hence, we develop a fast and practical model-selection approach for spatial regression models, focusing on the selection of coefficient types that include constant, spatially varying, and non-spatially varying coefficients. A pre-processing approach, which replaces data matrices with small inner products through dimension reduction dramatically accelerates the computation speed of model selection. Numerical experiments show that our approach selects the model accurately and computationally efficiently, highlighting the importance of model selection in the spatial regression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial and Panel Data Analysis · Land Use and Ecosystem Services · Economic and Environmental Valuation
