A latent shared-component generative model for real-time disease surveillance using Twitter data
Roberto C.S.N.P. Souza, Denise E.F de Brito, Renato M. Assun\c{c}\~ao,, Wagner Meira Jr

TL;DR
This paper introduces a generative model linking Twitter data and dengue case fluctuations, enabling real-time epidemic monitoring in small regions, demonstrated with Brazilian town data.
Contribution
The paper presents a novel shared-component generative model that jointly analyzes Twitter posts and disease case data for timely epidemic surveillance.
Findings
Model accurately predicts future dengue cases using Twitter and case data.
Joint modeling improves epidemic trend detection.
Effective for small geographical areas.
Abstract
Exploiting the large amount of available data for addressing relevant social problems has been one of the key challenges in data mining. Such efforts have been recently named "data science for social good" and attracted the attention of several researchers and institutions. We give a contribution in this objective in this paper considering a difficult public health problem, the timely monitoring of dengue epidemics in small geographical areas. We develop a generative simple yet effective model to connect the fluctuations of disease cases and disease-related Twitter posts. We considered a hidden Markov process driving both, the fluctuations in dengue reported cases and the tweets issued in each region. We add a stable but random source of tweets to represent the posts when no disease cases are recorded. The model is learned through a Markov chain Monte Carlo algorithm that produces the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Complex Network Analysis Techniques · Human Mobility and Location-Based Analysis
