# Machine Learning Used in Communicable Disease Control: A Scoping Review

**Authors:** Sharon Birdi, Atushi Patel, Roxana Rabet, Navreet Singh, Steve Durant, Tina Vosoughi, Faris Kapra, Mahek Shergill, Elnathan Mesfin, Carolyn Ziegler, Shehzad Ali, David Buckeridge, Marzyeh Ghassemi, Jennifer Gibson, Ava John-Baptiste, Jillian Macklin, Melissa Mccradden, Kwame Mckenzie, Sharmistha Mishra, Parisa Naraei, Akwasi Owusu-Bempah, Laura Rosella, James Shaw, Ross Upshur, Andrew D. Pinto

PMC · DOI: 10.3389/phrs.2026.1608074 · Public Health Reviews · 2026-02-13

## TL;DR

This paper reviews how machine learning is used to control communicable diseases and highlights the need to address algorithmic biases in these models.

## Contribution

The study systematically identifies ML applications in communicable disease control and emphasizes the under-addressed issue of algorithmic bias.

## Key findings

- Most studies focused on SARS-CoV-2, with fewer on malaria, HIV, and tuberculosis.
- Only a small percentage of studies addressed or mitigated algorithmic bias in their models.
- ML is increasingly used for disease surveillance, but bias mitigation remains a priority for equitable outcomes.

## Abstract

Communicable diseases continue to threaten global health, with COVID-19 as a recent example. Rapid data analysis using machine learning (ML) is crucial for detecting and controlling outbreaks. We aimed to identify how ML approaches have been applied to achieve public health objectives in communicable disease control and to explore algorithmic biases in model design, training, and implementation, and strategies to mitigate these biases.

We searched MEDLINE, Embase, Cochrane Central, Scopus, ACM DL, INSPEC, and Web of Science to identify peer-reviewed studies from 1 January 2000, to 15 July 2022. Included studies applied ML models in population and public health to address ten communicable diseases with high prevalence.

28,378 citations were retrieved, and 209 met our inclusion criteria. ML for communicable diseases has risen since 2020, particularly for SARS-CoV-2 (n = 177), followed by malaria, HIV, and tuberculosis. Eighteen studies (8.61%) considered bias, and only eleven implemented mitigation strategies.

A growing number of studies used ML for disease surveillance. Addressing biases in model design should be prioritized in future research to improve reliability and equity in public health outcomes.

## Linked entities

- **Diseases:** SARS-CoV-2 (MONDO:0100096), malaria (MONDO:0005136), tuberculosis (MONDO:0018076)

## Full-text entities

- **Diseases:** infection (MESH:D007239), COVID (MESH:D000086382), diarrheal diseases (MESH:D004403), death (MESH:D003643), HIV (MESH:D015658), malaria (MESH:D008288), Communicable Disease (MESH:D003141), pertussis (MESH:D014917), measles (MESH:D008457), hepatitis (MESH:D056486), tuberculosis (MESH:D014376), meningitis (MESH:D008580), respiratory infections (MESH:D012141), ML (MESH:D007859)
- **Species:** Human immunodeficiency virus 1 (no rank) [taxon 11676], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12945845/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12945845/full.md

## References

224 references — full list in the complete paper: https://tomesphere.com/paper/PMC12945845/full.md

---
Source: https://tomesphere.com/paper/PMC12945845