A Bayesian Spatial Model to Correct Under-Reporting in Urban Crowdsourcing
Gabriel Agostini, Emma Pierson, Nikhil Garg

TL;DR
This paper introduces a Bayesian spatial model to estimate true urban event occurrences from under-reported data, improving prediction accuracy and promoting equitable resource allocation across neighborhoods.
Contribution
It develops a Bayesian spatial latent variable model that accounts for under-reporting and spatial correlation, enhancing urban incident detection and equity in resource distribution.
Findings
Model predicts future reports more accurately than alternatives.
Allocations better reflect demographic disparities.
Reporting rates vary with neighborhood demographics.
Abstract
Decision-makers often observe the occurrence of events through a reporting process. City governments, for example, rely on resident reports to find and then resolve urban infrastructural problems such as fallen street trees, flooded basements, or rat infestations. Without additional assumptions, there is no way to distinguish events that occur but are not reported from events that truly did not occur--a fundamental problem in settings with positive-unlabeled data. Because disparities in reporting rates correlate with resident demographics, addressing incidents only on the basis of reports leads to systematic neglect in neighborhoods that are less likely to report events. We show how to overcome this challenge by leveraging the fact that events are spatially correlated. Our framework uses a Bayesian spatial latent variable model to infer event occurrence probabilities and applies it to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData-Driven Disease Surveillance · Economic and Environmental Valuation · Spatial and Panel Data Analysis
Methodstravel james · Sparse Evolutionary Training
