# Comparison of verbal autopsy using a large language model to biologically confirmed causes of death for malaria and other communicable diseases among children in six sub-Saharan African countries

**Authors:** Ronald Carshon-Marsh, Richard Wen, Thomas Kai Sze Ng, Rajeev Kamadod, Isaac Bogoch, Susan J. Bondy, Theodore J. Witek, Prabhat Jha

PMC · DOI: 10.1186/s12936-025-05774-z · Malaria Journal · 2026-01-06

## TL;DR

This study compares the accuracy of different verbal autopsy methods, including a large language model, in identifying malaria and other communicable diseases as causes of child deaths in sub-Saharan Africa.

## Contribution

The study introduces the use of a large language model (GPT-4o) for verbal autopsy and demonstrates its superior performance in identifying malaria-related deaths compared to existing models.

## Key findings

- GPT-4o correctly classified 46.2% of malaria deaths, outperforming InSilicoVA (30.0%) and InterVA-5 (23.1%).
- GPT-4o showed higher cause-specific mortality fraction accuracy (0.36) compared to InSilicoVA (0.07) and InterVA-5 (0.08).
- MITS confirmed that Sierra Leone had the highest proportion of post-neonatal malaria deaths at 30.3%.

## Abstract

Malaria, a preventable parasitic disease, causes most child deaths in sub-Saharan Africa (SSA). Reliable cause-of-death data are essential to evaluate progress toward the national and global malaria control goals. However, civil registration and vital statistics are often weak and incomplete in many low- and middle-income countries. In such circumstances, verbal autopsy (VA) provides an alternative means of mortality surveillance. In some settings, VA has been paired with Minimally Invasive Tissue Sampling (MITS) to obtain detailed biological confirmation of the causes of death. Here, we compare malaria-attributed and all-cause mortality among children younger than five years in six SSA countries, using three computer models (GPT-4o, InSilicoVA, and InterVA-5) to assign causes of death, against MITS as the reference standard.

We examined 3129 under-five deaths enrolled in six Child Health and Mortality Prevention Surveillance (CHAMPS) country sites in SSA between December 2016 and December 2022. Contrived free-text narrative summaries were generated for each record and coded into International Classification of Diseases (ICD-10) codes by GPT-4o. InSilicoVA and InterVA-5 outputs, provided in the World Health Organization 2016 VA codes, were harmonized to ICD-10 for comparison. The primary comparison was the underlying cause of death in VA models and MITS.

Sierra Leone had the highest proportion of post-neonatal deaths attributed to malaria at 30.3% (67/221), followed by Kenya at 17.3% (42/243), then Mozambique at 13% (18/138) and Mali at 5.5% (3/55) as defined by MITS. No malaria-attributable deaths were observed in neonates and stillbirths. GPT-4o correctly classified 60 (46.2%) of 130 malaria deaths, compared with 39 (30.0%) for InSilicoVA and 30 (23.1%) for InterVA-5. At the population level, the GPT-4o model achieved a higher cause-specific mortality fraction accuracy (0.36) compared to InSilicoVA (0.07) and InterVA-5 (0.08). GPT-4o performed comparatively better in attributing malaria, HIV/AIDS, and diarrhoeal diseases compared to other communicable diseases.

GPT-4o demonstrated superior performance over probabilistic VA models in identifying malaria-attributed deaths. National vital registration authorities and health ministries should consider integrating large language model-driven tools into their VA systems to enhance diagnostic precision. While less practicable at scale, focal and periodic MITS comparisons are useful for improving verbal autopsy systems. National mortality data are essential to track progress in reducing childhood deaths from malaria and other conditions.

The online version contains supplementary material available at 10.1186/s12936-025-05774-z.

## Linked entities

- **Diseases:** malaria (MONDO:0005136)

## Full-text entities

- **Diseases:** death (MESH:D003643), diarrhoeal diseases (MESH:D004194), Classification (MESH:D008310), Malaria (MESH:D008288), parasitic disease (MESH:D010272), stillbirths (MESH:D050497), HIV/AIDS (MESH:D015658), communicable diseases (MESH:D003141)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12870146/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12870146/full.md

## References

8 references — full list in the complete paper: https://tomesphere.com/paper/PMC12870146/full.md

---
Source: https://tomesphere.com/paper/PMC12870146