# A Unified Framework to Prioritize RNA Virus Cross-Species Transmission Risk Across an Expansive Host Landscape

**Authors:** Di Zhao, Yi-Fei Wang, Zu-Fei Yin, Ya-Fei Wu, Hui-Jun Yu, Luo-Yuan Xia, Xiao-He Liu, Xiao-Ming Cui, Xiao-Yu Shi, Dai-Yun Zhu, Na Jia, Jia-Fu Jiang, Wu-Chun Cao, Wenqiang Shi

PMC · DOI: 10.3390/v18020211 · Viruses · 2026-02-05

## TL;DR

A new framework called UniVH improves the prediction of which animals RNA viruses can infect, helping identify potential zoonotic threats.

## Contribution

UniVH is a unified framework that improves host prediction accuracy for RNA viruses across a broad host range.

## Key findings

- UniVH achieved a 71.2% host prediction accuracy for novel viruses discovered after 2020.
- The model outperformed conventional BLASTp-based methods by 15.3%.
- Viral structural genes and host immune- and metabolism-related genes were most important for predictions.

## Abstract

RNA viruses exhibit high mutation rates and strong host adaptive capacity, posing major public health challenges. Although meta-transcriptomic studies have uncovered vast numbers of novel RNA viral sequences, identifying those with spillover risks remains difficult. Current virus host-prediction methods can only predict a narrow set of host labels at coarse taxonomic levels (e.g., kingdom or order), which hampers precise evaluation of cross-species transmission risk and may overlook potential zoonotic hosts. To overcome these limitations, we developed UniVH, a unified virus–host association prediction framework trained on an exceptionally broad spectrum of 90 viral families and 240 host families, enabling robust prediction even for phylogenetically distant or data-scarce hosts. UniVH achieved a host prediction accuracy of 71.2% for novel viruses discovered after 2020, representing a 15.3% improvement over conventional BLASTp-based homology approaches. Feature interpretation revealed that viral structural genes and host immune- and metabolism-related genes contributed most significantly to predictive performance. Model predictions indicated widespread host-range expansion, with 20 mammalian virus families doubling their documented mammalian host ranges and several showing marked increases in viruses with human-infection potential. This unified, interpretable framework represents an important methodological advance for future RNA virus spillover-risk evaluation and emerging virus prioritization.

## Full-text entities

- **Genes:** PPBPP1 (pro-platelet basic protein pseudogene 1) [NCBI Gene 728045] {aka PPBPL1, TGB2}, TICAM1 (TIR domain containing adaptor molecule 1) [NCBI Gene 148022] {aka IIAE6, MyD88-3, PRVTIRB, TICAM-1, TRIF}
- **Diseases:** infection (MESH:D007239), infectious disease (MESH:D003141), zoonotic diseases (MESH:D015047), injury to (MESH:D014947), inflammatory (MESH:D007249), Natural killer (MESH:D000077428), mediated cytotoxicity (MESH:C567355)
- **Chemicals:** KEGG (-)
- **Species:** Retroviridae (family) [taxon 11632], Homo sapiens (human, species) [taxon 9606], Orthomyxoviridae (family) [taxon 11308], Ebola virus [taxon 186536], Sus scrofa (pig, species) [taxon 9823], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12945150/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12945150/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/PMC12945150/full.md

---
Source: https://tomesphere.com/paper/PMC12945150