A Graph-Prediction-Based Approach for Debiasing Underreported Data
Hanyang Jiang, Yao Xie

TL;DR
This paper introduces a graph-based debiasing algorithm that jointly estimates event counts and discovery probabilities in underreported data, effectively addressing issues in fields like policing and pandemic data analysis without relying on strong priors.
Contribution
The paper proposes a novel graph-based algorithm for debiasing underreported data that jointly estimates counts and discovery probabilities without strong priors.
Findings
Effective in simulation experiments
Successfully applied to police 911 call data
Improves accuracy of underreporting correction
Abstract
We present a novel Graph-based debiasing Algorithm for Underreported Data (GRAUD) aiming at an efficient joint estimation of event counts and discovery probabilities across spatial or graphical structures. This innovative method provides a solution to problems seen in fields such as policing data and COVID- data analysis. Our approach avoids the need for strong priors typically associated with Bayesian frameworks. By leveraging the graph structures on unknown variables and , our method debiases the under-report data and estimates the discovery probability at the same time. We validate the effectiveness of our method through simulation experiments and illustrate its practicality in one real-world application: police 911 calls-to-service data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Data Quality and Management · Data Management and Algorithms
