# A malaria seasonality dataset for sub-Saharan Africa

**Authors:** Francesca Sanna, Suzanne H. Keddie, Tara Boyhan, Paulina A. Dzianach, Michael McPhail, Julia Seitz, Thomas Nguyen, Adrian Redpath, Twatasha Chikolwa, Annie J. Browne, Jailos Lubinda, Adam Saddler, Sarah Hafsia, Rubi Jayaseelen, Hunter S. Baggen, Jennifer A. Rozier, Tasmin L. Symons, Joseph Harris, Sarah Connor, Camilo Vargas, Charles Whittaker, Michele Nguyen, Peter W. Gething, Daniel J. Weiss

PMC · DOI: 10.1038/s41597-025-05996-5 · Scientific Data · 2025-10-28

## TL;DR

This paper presents a publicly available dataset of malaria seasonality in sub-Saharan Africa, combining historical data from literature and surveillance to support targeted interventions.

## Contribution

The novel contribution is a comprehensive, geolocated dataset of malaria seasonality in sub-Saharan Africa, including both quantitative and qualitative data.

## Key findings

- The dataset includes malaria prevalence, incidence, mortality, and entomological timeseries from 2000 onwards.
- A novel natural language processing method was used to accelerate literature screening for seasonality descriptions.
- The dataset integrates data from literature, routine surveillance, and entomological sources for a holistic view.

## Abstract

Malaria imposes a significant global health burden and remains a major cause of child mortality in sub-Saharan Africa. In many countries, malaria transmission varies seasonally. The use of seasonally-deployed interventions is expanding, and the effectiveness of these control measures hinges on quantitative and geographically-specific characterisations of malaria seasonality. Malariometric timeseries from routine surveillance data and scientific and programmatic literature offer a resource for modelling patterns of malaria seasonality. This study creates and makes publicly available a geolocated dataset of historical timeseries describing malaria seasonality published since 2000 for sub-Saharan Africa. We used three approaches to assemble the dataset: i) an extensive literature review that included novel natural language processing to accelerate screening of published articles, ii) extractions from a routine surveillance dataset that contains geolocated data from all malaria-endemic countries, and iii) cross-referencing and incorporation of timeseries from a key entomological dataset. The resulting data include malaria prevalence, incidence, mortality, and entomological timeseries; and a novel assembly of qualitative descriptions of malaria seasonality extracted from published literature.

## Linked entities

- **Diseases:** malaria (MONDO:0005136)

## Full-text entities

- **Diseases:** Malaria (MESH:D008288)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12569248/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12569248/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/PMC12569248/full.md

---
Source: https://tomesphere.com/paper/PMC12569248