# Interpreting epidemiological surveillance data: a modelling study based on Pune city

**Authors:** Prathith Bhargav, Soumil Kelkar, Joy Merwin Monteiro, Philip Cherian

PMC · DOI: 10.1038/s41598-025-30023-3 · 2025-11-29

## TL;DR

This study uses simulations to assess how well real-world surveillance data reflects the true state of an epidemic in Pune city.

## Contribution

The novelty lies in using agent-based simulations to evaluate the representativeness of surveillance data for decision-making during epidemics.

## Key findings

- Simulations show how testing and contact tracing strategies influence surveillance data accuracy.
- The study highlights discrepancies between observed and actual epidemic trends due to data generation processes.
- Findings suggest implications for using surveillance data in public health decisions during outbreaks.

## Abstract

Routine epidemiological surveillance data represents one of the most continuous and current sources of data during the course of an epidemic. This data is used to calibrate epidemiological forecasting models, as well as for public health decision making such as the imposition and lifting of lockdowns and quarantine measures. However, such data is generated during testing and contact tracing and not through randomized sampling. Furthermore, since the process of generating this data affects the epidemic trajectory itself – identification of infected persons might lead to them being quarantined, for instance – it is unclear how representative such data is of the actual epidemic itself. For example, will the observed rise in infections correspond well with the actual rise in infections? To answer such questions, we employ epidemiological simulations not to study the effectiveness of different public health strategies in controlling the spread of the epidemic, but to study the quality of the resulting surveillance data and derived metrics and their utility for decision making. Using the BharatSim simulation framework, we build an agent-based epidemiological model with a detailed representation of testing and contact tracing strategies based on those employed in Pune city during the COVID-19 pandemic to generate synthetic surveillance data. Infected persons are identified, quarantined and/or hospitalised based on these strategies. We perform extensive simulations to study the impact of different public health strategies and the availability of tests and contact tracing efficiencies on the resulting surveillance data as well as on the course of the epidemic. The fidelity of the resulting surveillance data in representing the real-time state of the epidemic and in decision-making is explored in the context of Pune city.

## Linked entities

- **Diseases:** COVID-19 (MONDO:0100096)

## Full-text entities

- **Diseases:** Infected (MESH:D007239), dead (MESH:D001926), death (MESH:D003643), COVID-19 (MESH:D000086382), flu (MESH:D007251)
- **Chemicals:** DT (MESH:D013936)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12775538/full.md

---
Source: https://tomesphere.com/paper/PMC12775538