# Protein embeddings reveal a continuous molecular landscape of host adaptation in waterfowl parvoviruses

**Authors:** Nihui Shao, Yunfei Guo

PMC · DOI: 10.3389/fbinf.2025.1738737 · Frontiers in Bioinformatics · 2026-01-27

## TL;DR

This study shows how waterfowl parvoviruses adapt to different hosts through gradual molecular changes revealed by AI and biophysics.

## Contribution

The study introduces a framework linking AI-based protein embeddings to biophysical mechanisms of viral host adaptation.

## Key findings

- Protein embeddings reveal a continuous molecular landscape between two parvovirus types.
- A flexible surface loop is identified as a biophysical pivot in host adaptation.
- A GPV-type isolate efficiently replicates in ducklings, showing cross-host infectivity.

## Abstract

Viral adaptation across closely related hosts often proceeds through subtle molecular changes that escape detection by classical phylogenetic analyses. In waterfowl parvoviruses, we integrate AI-based protein language modeling, structural biophysics, and infection assays to reveal a continuous trajectory of host adaptation linking Goose parvovirus (GPV) and Muscovy duck parvovirus (MDPV). Protein embeddings of VP1 sequences reveal a smooth manifold bridging GPV and MDPV, which softens an apparent phylogenetic dichotomy into a graded molecular topology. Structural modeling identifies a flexible surface loop (residues 300–420) as a biophysical pivot. Along the embedding trajectory, this loop undergoes gradual conformational expansion and electrostatic neutralization, quantitatively linking embedding coordinates to capsid surface remodeling. Experimentally, a GPV-type isolate recovered from naturally diseased ducks replicated efficiently in duck embryos, duck embryo fibroblasts, and live ducklings, producing characteristic lesions. These results show that waterfowl parvoviruses evolve along a continuous molecular–electrostatic landscape in which cumulative structural adjustments enable cross-host infectivity. Our framework connects AI-derived molecular representations to biophysical mechanisms and biological function, supporting a model of viral host adaptation as a predominantly continuous process and providing a foundation for predicting cross-host potential in emerging viral systems.

## Linked entities

- **Proteins:** VP1 (pyrophosphate-energized vacuolar membrane proton pump 1)

## Full-text entities

- **Genes:** VP1 [NCBI Gene 1403423]
- **Diseases:** infection (MESH:D007239)
- **Species:** Goose parvovirus (no rank) [taxon 38251], Muscovy duck parvovirus (no rank) [taxon 37325]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12887590/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12887590/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/PMC12887590/full.md

---
Source: https://tomesphere.com/paper/PMC12887590