Identifying Selections Operating on HIV-1 Reverse Transcriptase via Uniform Manifold Approximation and Projection
Shefali Qamar, Manel Camps, Jay Kim

TL;DR
This study uses UMAP and clustering to analyze HIV-1 reverse transcriptase sequences, revealing distinct evolutionary groups associated with treatment regimens, aiding understanding of drug resistance evolution.
Contribution
The paper introduces a novel application of UMAP combined with clustering to identify evolutionary sectors in HIV RT sequences related to drug resistance.
Findings
Identified 21 distinct sequence clusters.
Clusters correlate with treatment regimens.
Mutation signatures suggest higher-order epistasis.
Abstract
We analyze 14,651 HIV1 reverse transcriptase (HIV RT) sequences from the Stanford HIV Drug Resistance Database labeled with treatment regimen in order to study the evolution this enzyme under drug selection in the clinic. Our goal is to identify distinct sectors of HIV RT's sequence space that are undergoing evolution as a way to identify individual selections and/or evolutionary solutions. We utilize Uniform Manifold Approximation and Projection (UMAP), a graph-based dimensionality reduction technique uniquely suited for the detection of non-linear dependencies and visualize the results using an unsupervised clustering algorithm based on density analysis. Our analysis produced 21 distinct clusters of sequences. Supporting the biological significance of these clusters, they tend to represent phylogenetically related sequences with strong correspondence to distinct treatment regimens.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHIV Research and Treatment · HIV/AIDS drug development and treatment · Bioinformatics and Genomic Networks
