# Inferring human behavior through online social networks may provide accurate behavioral estimates for outbreak forecasting of arboviruses

**Authors:** Frédéric Jourdain, Debapriyo Chakraborty, Beatrice Gaillard, Arnaud Gautier, Frédéric Simard, Pierre Jay Robert, Laurent Dormont, Jean-Claude Desenclos, Benjamin Roche

PMC · DOI: 10.1371/journal.pgph.0004889 · PLOS Global Public Health · 2025-07-24

## TL;DR

This study shows that social media data can accurately estimate human behavior related to mosquito contact, which helps predict arbovirus outbreaks.

## Contribution

The study provides the first evidence that social media data can match large-scale surveys in estimating behavior relevant to outbreak forecasting.

## Key findings

- Emotional state estimates from X data differed from survey data across age groups.
- Adjusting for demographics and variability in models produced comparable outbreak predictions.
- X data can forecast outbreak dynamics similarly to traditional surveys.

## Abstract

Human behavior is known to be a fundamental, yet often neglected, component of infectious disease epidemiology, especially during outbreaks. To quantify its role and fluctuations, analyzing message contents on popular online social networks – part of so-called digital epidemiology – is a promising approach. However, such methods could be biased and generate estimation errors since social media users may not be representative of the general population. To address this, we systematically compared social media-derived estimates with those obtained from a large-scale opinion survey. In the setting of metropolitan France, where the risk of arbovirus outbreaks is increasingly important, we compared the frequency of three types of emotional states related to human-mosquito contact identified in 160,000 messages on X (formerly Twitter) with the frequency of the same emotional states expressed through a large-scale opinion survey involving 15,000 people during the same period. Both sources of data were used to parametrize a mathematical model of mosquito-borne virus transmission. We found that estimates of these emotional states for different age groups in the opinion survey could be highly different from estimates based on X data. Nevertheless, by integrating demographic adjustments and incorporating variability into our transmission models, we showed that the predicted overall outbreak dynamics remain comparable under certain conditions. This study provides the first evidence that using digital social network data to infer epidemiologically relevant behavior achieves similar results as using large-scale opinion survey data. These outcomes highlight that X data could be used to help forecast outbreaks dynamics, opening new opportunities for real-time assessment of human health-related behavior and the definition of control strategies.

## Full-text entities

- **Diseases:** infectious disease (MESH:D003141)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12289027/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12289027/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC12289027/full.md

---
Source: https://tomesphere.com/paper/PMC12289027