# Predicting viral sensitivity to antibodies using genetic sequences and antibody similarities

**Authors:** Kai S. Shimagaki, Gargi Kher, Rebecca M. Lynch, John P. Barton, Amber M Smith, Adam Ewing, Amber M Smith, Adam Ewing, Amber M Smith, Adam Ewing

PMC · DOI: 10.1371/journal.pcbi.1014095 · PLOS Computational Biology · 2026-03-23

## TL;DR

This paper introduces a method called GNL to predict how sensitive viruses are to antibodies using genetic data and antibody similarities, which can help in designing better antibody therapies.

## Contribution

The novel contribution is the development of GNL, an interpretable method that leverages genetic sequences and antibody similarities to predict viral sensitivity.

## Key findings

- GNL performs favorably compared to state-of-the-art methods, especially when data is limited.
- The model identifies key viral mutations that influence antibody sensitivity and aligns with known resistance mechanisms.
- GNL can predict sensitivity for viral sequences without any measured neutralization data.

## Abstract

For genetically variable pathogens such as human immunodeficiency virus (HIV)-1, individual viral isolates can differ dramatically in their sensitivity to antibodies. The ability to predict which viruses will be sensitive and which will be resistant to a specific antibody could aid in the design of antibody therapies and help illuminate resistance evolution. Due to the enormous number of possible combinations, it is not possible to experimentally measure neutralization values for all pairs of viruses and antibodies. Here, we developed a simple and interpretable method called grouped neutralization learning (GNL) to predict neutralization values by leveraging viral genetic sequences and similarities in neutralization profiles between antibodies. The trained model is interpretable and can identify key mutations that impact viral sensitivity. Our method compares favorably to state-of-the-art approaches and is robust to model parameter assumptions. GNL can predict neutralization values for viral sequences without observed neutralization measurements, an important capability for assessing antibody coverage in populations whose viral diversity is genetically characterized. We also demonstrate that GNL can successfully transfer knowledge between independent data sets, allowing rapid estimates of viral sensitivity based on prior knowledge.

Antibodies play a central role in controlling viral infections, yet viral strains can differ widely in their sensitivity to a given antibody. Quantifying this sensitivity is critical for understanding immune escape, interpreting experimental and clinical studies, and designing effective antibody-based therapies. However, virus neutralization assays are labor-intensive and inherently sparse: only a small fraction of the vast space of possible virus-antibody combinations can be measured experimentally. Here, we present a computational framework, grouped neutralization learning (GNL), for predicting viral sensitivity across large virus-antibody panels using incomplete neutralization data and viral genetic sequences. The central idea of GNL is to share information across antibodies with similar neutralization profiles, allowing sparsely measured antibodies to benefit from related, more extensively characterized ones. Building on prior insights that neutralization data exhibit strong low-rank structure, we integrate antibody-level information sharing with sequence-based prediction and low-rank matrix refinement to produce robust and accurate neutralization predictions. Across large HIV-1 neutralization datasets, GNL performs favorably compared to state-of-the-art methods, particularly in when data is limited. Our model is interpretable, identifying specific viral mutations that strongly influence antibody sensitivity and recovering patterns consistent with known resistance mechanisms. Importantly, GNL can generate sensitivity estimates even for viral sequences without any measured neutralization data. Collectively, this work provides a scalable and interpretable framework for mapping virus-antibody interactions from partial data. While developed and validated using HIV-1, our approach is broadly applicable to other rapidly evolving pathogens for which genetic sequences and sparse neutralization measurements are available.

## Full-text entities

- **Genes:** RGN (regucalcin) [NCBI Gene 9104] {aka GNL, HEL-S-41, RC, SMP30}, Env [NCBI Gene 155971], CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}
- **Diseases:** Ewing (MESH:D012512), viral infections (MESH:D014777), infection (MESH:D007239), HIV-1 infected (MESH:D015658)
- **Chemicals:** glycan (MESH:D011134), N (MESH:D009584), IIP (-), Asn (MESH:D001216), CH103 (MESH:C014893)
- **Species:** Homo sapiens (human, species) [taxon 9606], Human immunodeficiency virus 1 (no rank) [taxon 11676]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13020759/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13020759/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC13020759/full.md

---
Source: https://tomesphere.com/paper/PMC13020759