# A simple phylogenetic approach to analyze hypermutated HIV proviruses reveals insights into their dynamics and persistence during antiretroviral therapy

**Authors:** Aniqa Shahid, Bradley R. Jones, Maggie C. Duncan, Signe MacLennan, Michael J. Dapp, Mark H. Kuniholm, Bradley Aouizerat, Nancie M. Archin, Stephen Gange, Igho Ofotokun, Margaret A. Fischl, Seble Kassaye, Harris Goldstein, Kathryn Anastos, Jeffrey B. Joy, Zabrina L. Brumme

PMC · DOI: 10.21203/rs.3.rs-4549934/v1 · 2024-06-13

## TL;DR

A new phylogenetic method helps understand how hypermutated HIV viruses persist during treatment, revealing they can last for decades and are seeded early after infection.

## Contribution

A novel phylogenetic approach to analyze hypermutated HIV proviruses, enabling accurate molecular dating and insights into their persistence during ART.

## Key findings

- Hypermutated proviruses span a wide age range, with some dating back to shortly after HIV infection.
- Corrected phylogenetic trees produce integration date estimates consistent with benchmark trees excluding hypermutated sequences.
- In some individuals, hypermutated proviruses differ in age distribution from env-intact ones, suggesting heterogeneous decay rates.

## Abstract

Hypermutated proviruses, which arise in a single HIV replication cycle when host antiviral APOBEC3 proteins introduce extensive G-to-A mutations throughout the viral genome, persist in all people living with HIV receiving antiretroviral therapy (ART). But, the within-host evolutionary origins of hypermutated sequences are incompletely understood because phylogenetic inference algorithms, which assume that mutations gradually accumulate over generations, incorrectly reconstruct their ancestor-descendant relationships. Using > 1400 longitudinal single-genome-amplified HIV env-gp120 sequences isolated from six women over a median 18 years of follow-up − including plasma HIV RNA sequences collected over a median 9 years between seroconversion and ART initiation, and > 500 proviruses isolated over a median 9 years on ART − we evaluated three approaches for removing hypermutation from nucleotide alignments. Our goals were to 1) reconstruct accurate phylogenies that can be used for molecular dating and 2) phylogenetically infer the integration dates of hypermutated proviruses persisting during ART. Two of the tested approaches (stripping all positions containing putative APOBEC3 mutations from the alignment, or replacing individual putative APOBEC3 mutations in hypermutated sequences with the ambiguous base R) consistently normalized tree topologies, eliminated erroneous clustering of hypermutated proviruses, and brought env-intact and hypermutated proviruses into comparable ranges with respect to multiple tree-based metrics. Importantly, these corrected trees produced integration date estimates for env-intact proviruses that were highly concordant with those from benchmark trees that excluded hypermutated sequences, indicating that the corrected trees can be used for molecular dating. Use of these trees to infer the integration dates of hypermutated proviruses persisting during ART revealed that these spanned a wide age range, with the oldest ones dating to shortly after infection. This indicates that hypermutated proviruses, like other provirus types, begin to be seeded into the proviral pool immediately following infection, and can persist for decades. In two of the six participants, hypermutated proviruses differed from env-intact ones in terms of their age distributions, suggesting that different provirus types decay at heterogeneous rates in some hosts. These simple approaches to reconstruct hypermutated provirus’ evolutionary histories, allow insights into their in vivo origins and longevity, towards a more comprehensive understanding of HIV persistence during ART.

## Linked entities

- **Proteins:** Apobec3 (apolipoprotein B mRNA editing enzyme, catalytic polypeptide 3)

## Full-text entities

- **Genes:** ITIH4 (inter-alpha-trypsin inhibitor heavy chain 4) [NCBI Gene 3700] {aka GP120, H4P, IHRP, ITI-HC4, ITIHL1, PK-120}
- **Diseases:** infection (MESH:D007239), HIV (MESH:D015658)
- **Species:** Human immunodeficiency virus 1 (no rank) [taxon 11676], Homo sapiens (human, species) [taxon 9606]

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11213167/full.md

---
Source: https://tomesphere.com/paper/PMC11213167