# Quantifying prevalence and risk factors of HIV multiple infection in Uganda from population-based deep-sequence data

**Authors:** Michael A. Martin, Andrea Brizzi, Xiaoyue Xi, Ronald Moses Galiwango, Sikhulile Moyo, Deogratius Ssemwanga, Alexandra Blenkinsop, Andrew D. Redd, Lucie Abeler-Dörner, Christophe Fraser, Steven J. Reynolds, Thomas C. Quinn, Joseph Kagaayi, David Bonsall, David Serwadda, Gertrude Nakigozi, Godfrey Kigozi, M. Kate Grabowski, Oliver Ratmann, Richard A. Koup, Penny L. Moore, Richard A. Koup, Penny L. Moore, Richard A. Koup, Penny L. Moore

PMC · DOI: 10.1371/journal.ppat.1013065 · PLOS Pathogens · 2025-04-22

## TL;DR

This study estimates that about 4% of people with HIV in Uganda have multiple infections, which can lead to new HIV strains and higher transmission risks.

## Contribution

The study introduces a novel Bayesian deep-phylogenetic model to accurately estimate multiple HIV infections from deep-sequence data.

## Key findings

- Approximately 4.09% of viremic HIV participants in Uganda had multiple infections.
- Individuals in high-HIV-prevalence communities were 2.33 times more likely to have multiple infections.
- The new model accounts for sequencing biases and improves the accuracy of multiple infection detection.

## Abstract

People living with HIV can acquire secondary infections through a process called superinfection, giving rise to simultaneous infection with genetically distinct variants (multiple infection). Multiple infection provides the necessary conditions for the generation of novel recombinant forms of HIV and may worsen clinical outcomes and increase the rate of transmission to HIV seronegative sexual partners. To date, studies of HIV multiple infection have relied on insensitive bulk-sequencing, labor intensive single genome amplification protocols, or deep-sequencing of short genome regions. Here, we identified multiple infections in whole-genome or near whole-genome HIV RNA deep-sequence data generated from plasma samples of 2,029 people living with viremic HIV who participated in the population-based Rakai Community Cohort Study (RCCS). We estimated individual- and population-level probabilities of being multiply infected and assessed epidemiological risk factors using the novel Bayesian deep-phylogenetic multiple infection model (deep − phyloMI) which accounts for bias due to partial sequencing success and false-negative and false-positive detection rates. We estimated that between 2010 and 2020, 4.09% (95% highest posterior density interval (HPD) 2.95%–5.45%) of RCCS participants with viremic HIV multiple infection at time of sampling. Participants living in high-HIV prevalence communities along Lake Victoria were 2.33-fold (95% HPD 1.3–3.7) more likely to harbor a multiple infection compared to individuals in lower prevalence neighboring communities. This work introduces a high-throughput surveillance framework for identifying people with multiple HIV infections and quantifying population-level prevalence and risk factors of multiple infection for clinical and epidemiological investigations.

HIV exists as a population of genetically distinct viral variants among people living with HIV. People living with HIV can be infected with genetically distinct variants. Identification of these mixed infections requires generating viral genomic data from people living with HIV. In the past, the approaches used to identify multiple infections from viral genomic data have had poor sensitivity or required labor intensive protocols that are prohibitive in application to large data sets. Prior work has also only utilized data generated from small portions of the viral genome and the statistical procedures used to generate population-level estimates from sequencing data generated from individual infections has not accounted for incomplete sampling of the within-host viral population or sources of sequencing error, which may confound multiple infection estimates. Here, we develop a statistical model that addresses these limitations and allows for the identification of multiple infections and the estimation of population-level risk of multiple infection from deep-sequence data. We fit this model to population-based HIV genomic data from people living with HIV in southern Uganda and estimate that approximately 4% of viremic participants harbor a multiple infection at a given point in time. We show that the prevalence of multiple infections is higher in key populations with high HIV prevalence. These findings inform our understanding of the sexual risk networks that give rise to multiple infections and aid in efforts to model HIV epidemiological dynamics and evolution during a period of incidence declines and shifting transmission dynamics across Eastern and Southern Africa.

## Full-text entities

- **Diseases:** infected (MESH:D007239), HIV (MESH:D015658)
- **Species:** Homo sapiens (human, species) [taxon 9606], Human immunodeficiency virus 1 (no rank) [taxon 11676]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12055032/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12055032/full.md

## References

122 references — full list in the complete paper: https://tomesphere.com/paper/PMC12055032/full.md

---
Source: https://tomesphere.com/paper/PMC12055032