Crime in Philadelphia: Bayesian Clustering with Particle Optimization
Cecilia Balocchi, Sameer K. Deshpande, Edward I. George, Shane T., Jensen

TL;DR
This paper introduces a Bayesian clustering method with particle optimization to improve crime trend estimation in Philadelphia, accounting for spatial discontinuities and boundaries, leading to more accurate and calibrated forecasts.
Contribution
It presents a novel prior for Bayesian hierarchical models that partitions neighborhoods into clusters, and a new local search ensemble optimization for partition detection, enhancing crime trend analysis.
Findings
Effective in capturing spatial discontinuities in crime data.
Demonstrates improved estimation accuracy on simulated and real data.
Identifies meaningful neighborhood clusters in Philadelphia.
Abstract
Accurate estimation of the change in crime over time is a critical first step towards better understanding of public safety in large urban environments. Bayesian hierarchical modeling is a natural way to study spatial variation in urban crime dynamics at the neighborhood level, since it facilitates principled ``sharing of information'' between spatially adjacent neighborhoods. Typically, however, cities contain many physical and social boundaries that may manifest as spatial discontinuities in crime patterns. In this situation, standard prior choices often yield overly-smooth parameter estimates, which can ultimately produce mis-calibrated forecasts. To prevent potential over-smoothing, we introduce a prior that partitions the set of neighborhoods into several clusters and encourages spatial smoothness within each cluster. In terms of model implementation, conventional stochastic search…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Spatial and Panel Data Analysis · Data Analysis with R
