# Development and validation of HIV SMRTcap for the characterization of HIV-1 reservoirs across tissues and subtypes

**Authors:** Ghazal Sadri, Steven T. Nadakal, William Lauer, Justin Kos, Parmit K. Singh, Erin Elliott, Catherine W. Kaiser, Easton E. Ford, Nadia Richardson, Kaitlyn M. Shields, Elizabeth Hudson, Noemi L. Linden, Ali Danesh, James Powell, Peter Warburton, Juan Soto, Matthew Emery, Gintaras Deikus, Guinevere Q. Lee, Susanna L. Lamers, Steven J. Reynolds, Ronald Galiwango, Jessica L. Prodger, Stephen Tomusange, Taddeo Kityamuweesi, Tina Han, R. Brad Jones, Aaron A. R. Tobian, Alan N. Engelman, Robert Sebra, Susan Morgello, Andrew D. Redd, David Sachs, Eric Rouchka, Melissa L. Smith, Susan R. Ross, Jason M. Brenchley, Susan R. Ross, Jason M. Brenchley, Susan R. Ross, Jason M. Brenchley, Susan R. Ross, Jason M. Brenchley

PMC · DOI: 10.1371/journal.ppat.1013171 · 2026-01-13

## TL;DR

A new method called HIV SMRTcap is developed to better understand where HIV hides in the body, which could help in finding a cure.

## Contribution

HIV SMRTcap is a novel pipeline that simultaneously identifies HIV integration sites and proviral integrity using long-read sequencing.

## Key findings

- HIV SMRTcap enables comprehensive characterization of HIV-1 reservoirs across major global subtypes.
- The method works with both cell- and tissue-derived inputs, including samples from individuals on antiretroviral therapy.
- HIV SMRTcap provides high sensitivity for detecting HIV in various contexts and tissues.

## Abstract

Human Immunodeficiency Virus type 1 (HIV-1) is responsible for the global HIV/AIDS epidemic and the establishment of an integrated HIV-1 reservoir remains the primary obstacle to cure. Upon therapy interruption, reactivation of the persistent HIV-1 reservoir propagates viral rebound and mediates continued immunological decline. While furthering understanding of the HIV-1 reservoir is essential for HIV-1 cure, commonly used sequencing strategies are often limited by the reliance on short-read sequencing across separate assays to determine integration sites and proviral integrity – something that does not always adequately resolve complex human genomic repeats or low complexity regions. Simultaneous identification of proviral integration sites and proviral integrity at the single molecule level would enable HIV-1 reservoir characterization with minimal imputation or bioinformatic reconstruction. Here we present HIV Single Molecule Real Time Capture (HIV SMRTcap), a novel molecular and computational pipeline that directly and simultaneously identifies HIV-1 integration sites, defines proviral integrity, and characterizes clonal expansion of HIV-1 provirus-containing cells with single molecule resolution. In combination with long-read, single-molecule, real-time (SMRT) sequencing and custom analytic pipelines, HIV SMRTcap enables a highly comprehensive characterization of HIV-1 reservoirs. Moreover, we demonstrate here that HIV SMRTcap performs robustly across the major global subtypes (HIV-1 subtype A, B, C, D and A/D recombinant viruses), and can use both cell- and tissue-derived inputs, including samples from antiretroviral therapy (ART) treated individuals with undetectable viral loads. Our results demonstrate that HIV SMRTcap serves as a comprehensive, robust method for unbiased HIV-1 reservoir characterization. Used alone, or in combination with single-cell based methods, HIV SMRTcap will enable novel exploration of viral reservoirs across subtypes and in tissue-specific compartments, providing critical information needed to inform HIV-1 cure.

Human Immunodeficiency Virus (HIV) remains a persistent global and lifelong infection because the virus integrates into the genome of infected cells and is then protected by host mechanisms. Characterization of these virus-harboring cells (“reservoir”) has proven difficult due to a multitude of technical limitations, but remains critical as this viral reservoir reestablishes active infection upon therapy interruption. Understanding HIV integration patterns with regards to preferred genomic loci, prevalence of reservoir-containing cells within tissues, and profiling of internal deletions within the integrated viruses (proviruses) is pivotally important for vaccine and therapy design. In this work, we describe HIV Single Molecule Real Time Capture (HIV SMRTcap), an innovative pipeline that leverages long-read sequencing to simultaneously resolve proviral integration site(s) and integrity. This strategy enriches HIV-containing genomic DNA prior to single molecule sequencing and analyses using a custom informatic suite, ultimately capturing HIV at high sensitivity in contexts with and without antiretroviral therapy suppression, from primary tissue sources, and across HIV subtypes. We aim to describe HIV SMRTcap and its paired analysis tools in detail to enable wide adoption and encourage a pansubtype approach for equitable HIV research.

## Full-text entities

- **Genes:** gag-pol (Gag-Pol) [NCBI Gene 155348], ISG15 (ISG15 ubiquitin like modifier) [NCBI Gene 9636] {aka G1P2, IFI15, IMD38, IP17, UCRP, hUCRP}, vpu [NCBI Gene 155945], TAP2 (transporter 2, ATP binding cassette subfamily B member) [NCBI Gene 6891] {aka ABC18, ABCB3, APT2, D6S217E, MHC1D2, PSF-2}, Dnase1 (deoxyribonuclease I) [NCBI Gene 13419] {aka DNaseI, Dnl1}, CFLAR (CASP8 and FADD like apoptosis regulator) [NCBI Gene 8837] {aka CASH, CASP8AP1, CLARP, Casper, FLAME, FLAME-1}, TWIST1 (twist family bHLH transcription factor 1) [NCBI Gene 7291] {aka ACS3, BPES2, BPES3, CRS, CRS1, CSO}, nef [NCBI Gene 156110], CCR5 (C-C motif chemokine receptor 5) [NCBI Gene 1234] {aka CC-CKR-5, CCCKR5, CCR-5, CD195, CKR-5, CKR5}, vif (Vif) [NCBI Gene 155459], STAT5A (signal transducer and activator of transcription 5A) [NCBI Gene 6776] {aka MGF, STAT5}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, CCR3 (C-C motif chemokine receptor 3) [NCBI Gene 1232] {aka C C CKR3, CC-CKR-3, CD193, CKR 3, CKR3, CMKBR3}, CXCR4 (C-X-C motif chemokine receptor 4) [NCBI Gene 7852] {aka CD184, D2S201E, FB22, HM89, HSY3RR, LCR1}, BACH2 (BACH transcriptional regulator 2) [NCBI Gene 60468] {aka BTBD25, IMD60}, RNASE1 (ribonuclease A family member 1, pancreatic) [NCBI Gene 6035] {aka RAC1, RIB1, RNS1}, env [NCBI Gene 155971], MLC1 (modulator of VRAC current 1) [NCBI Gene 23209] {aka LVM, MLC, VL}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, vpr (Vpr) [NCBI Gene 155807], Cd4 (CD4 antigen) [NCBI Gene 12504] {aka L3T4, Ly-4}, SMARCE1 (SWI/SNF related BAF chromatin remodeling complex subunit E1) [NCBI Gene 6605] {aka BAF57, CSS5}, gag (Pr55(Gag)) [NCBI Gene 155030]
- **Diseases:** infected (MESH:D007239), inflammatory pericarditis (MESH:D010493), dilated cardiomyopathy (MESH:D002311), Allergy and Infectious Diseases (MESH:D003141), viremia (MESH:D014766), HIV (MESH:D015658), myocarditis (MESH:D009205), HIV/AIDS (MESH:D016263), HIV-1 infection (MESH:D015490), congestive heart failure (MESH:D006333), death (MESH:D003643), acute lymphoblastic leukemia (MESH:D054198), dislocation (MESH:D004204), chronic inflammation (MESH:D007249)
- **Chemicals:** nitrogen (MESH:D009584), NaOH (MESH:D012972), AMPure (-), L-Glutamine (MESH:D005973), basiliximab (MESH:D000077552), Alemtuzumab (MESH:D000074323), penicillin (MESH:D010406), streptomycin (MESH:D013307), DMSO (MESH:D004121), PB (MESH:D007854), CO2 (MESH:D002245), aluminum (MESH:D000535), HEPES (MESH:D006531), water (MESH:D014867), Nucleotide (MESH:D009711)
- **Species:** Human immunodeficiency virus 1 (no rank) [taxon 11676], HIV-1 group M (no rank) [taxon 388795], Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** CEM — Homo sapiens (Human), Childhood T acute lymphoblastic leukemia, Cancer cell line (CVCL_0207), 8e5 — Opodiphthera eucalypti (Emperor gum moth), Spontaneously immortalized cell line (CVCL_Z109), SupT1 — Homo sapiens (Human), Childhood T lymphoblastic lymphoma, Cancer cell line (CVCL_1714)

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12851485/full.md

---
Source: https://tomesphere.com/paper/PMC12851485