# A Prediction Model for Uncoating Receptor Usage in Human Enteroviruses Based on Amino Acid Sequences and a Naive Bayes Algorithm

**Authors:** Yongtao Jia, Zhenyu Xie, Guoying Zhu, Changzheng Dong

PMC · DOI: 10.3390/v18020236 · Viruses · 2026-02-13

## TL;DR

This paper develops a model to predict which receptors human enteroviruses use to infect cells, based on amino acid sequences and a machine learning algorithm.

## Contribution

A novel prediction model using amino acid sequences and Naive Bayes algorithm to identify uncoating receptors in human enteroviruses.

## Key findings

- The model achieved 100% prediction accuracy on training and validation datasets.
- Most enterovirus serotypes use seven known receptors like SCARB2, CAR, and ICAM-1.
- Some serotypes may use a novel, unknown receptor.

## Abstract

This study constructed a bioinformatics prediction algorithm for human enterovirus uncoating receptors based on amino acid sequences and physicochemical properties. Based on the availability of uncoating receptor information and three-dimensional (3D) structural data, human enterovirus serotypes were classified into training, validation, and prediction datasets. Using amino acid sequences of receptor-binding sites and their physicochemical properties as model features, a prediction model was constructed using the Naive Bayes algorithm and bioinformatic network analysis method. The results showed that both the training and validation datasets achieved a prediction accuracy of 100%. Among the 56 serotypes in the prediction dataset, the vast majority utilized seven known types of uncoating receptors (e.g., SCARB2, CAR, and ICAM-1), while a minority of serotypes may share the same novel, unknown receptor. This study indicates that uncoating receptors can be accurately predicted based on the amino acid sequences and physicochemical properties of human enteroviruses. Furthermore, the three-dimensional structural features at receptor-binding sites can be reflected through corresponding amino acid sequences and their physicochemical properties. This study facilitates a more in-depth investigations of enterovirus pathogenic mechanisms and provides important insights for the development of vaccines and antiviral drugs.

## Linked entities

- **Proteins:** SCARB2 (scavenger receptor class B member 2), CASR (calcium sensing receptor), ICAM1 (intercellular adhesion molecule 1)

## Full-text entities

- **Genes:** ICAM5 (intercellular adhesion molecule 5) [NCBI Gene 7087] {aka TLCN, TLN}, CXADR (CXADR cell adhesion molecule) [NCBI Gene 1525] {aka CAR, CAR4/6, HCAR}, CXADRP1 (CXADR pseudogene 1) [NCBI Gene 653108] {aka CAR, CXADRP}, ICAM1 (intercellular adhesion molecule 1) [NCBI Gene 3383] {aka BB2, CD54, P3.58}, ANXA2 (annexin A2) [NCBI Gene 302] {aka ANX2, ANX2L4, CAL1H, HEL-S-270, LIP2, LPC2}, KREMEN1 (kringle containing transmembrane protein 1) [NCBI Gene 83999] {aka ECTD13, KREMEN, KRM1}, CD209 (CD209 molecule) [NCBI Gene 30835] {aka CDSIGN, CLEC4L, DC-SIGN, DC-SIGN1, hDC-SIGN}, SCARB2 (scavenger receptor class B member 2) [NCBI Gene 950] {aka AMRF, CD36L2, EPM4, HLGP85, LGP85, LIMP-2}, PVR (PVR cell adhesion molecule) [NCBI Gene 5817] {aka CD155, HVED, NECL5, Necl-5, PVS, TAGE4}, MPZL2 (myelin protein zero like 2) [NCBI Gene 10205] {aka DFNB111, EVA, EVA1}, EVC (EvC ciliary complex subunit 1) [NCBI Gene 2121] {aka DWF-1, EVC1, EVCL}, CD55 (CD55 molecule (Cromer blood group)) [NCBI Gene 1604] {aka CHAPLE, CR, CROM, DAF, TC}, SELPLG (selectin P ligand) [NCBI Gene 6404] {aka CD162, CLA, PSGL-1, PSGL1}, FCGRT (Fc gamma receptor and transporter) [NCBI Gene 2217] {aka FCRN, FcgammaRn, alpha-chain}
- **Diseases:** myocarditis (MESH:D009205), pneumonia (MESH:D011014), HFMD (MESH:D006232), injury to (MESH:D014947), EVB (MESH:D004769), acute flaccid myelitis (MESH:C000629404), poliomyelitis (MESH:D011051), viral encephalitis (MESH:D018792), bronchitis (MESH:D001991)
- **Chemicals:** SGAG (MESH:C013786), CVA (MESH:C034482), sialic acid (MESH:D019158), lipid (MESH:D008055), CVB (-), HS (MESH:D006497), amino acids (MESH:D000596)
- **Species:** enterovirus D68 (no rank) [taxon 42789], Coxsackievirus A24 (no rank) [taxon 12089], Enterovirus B (no rank) [taxon 138949], Coxsackievirus B3 (no rank) [taxon 12072], Enterovirus A71 (no rank) [taxon 39054], Coxsackievirus B1 (no rank) [taxon 12071], Enterovirus A (no rank) [taxon 138948], Coxsackievirus B5 (no rank) [taxon 12074], Enterococcus sp. VB (species) [taxon 1483714], Human enterovirus (species) [taxon 1193974], Coxsackievirus A16 (no rank) [taxon 31704], Enterovirus (genus) [taxon 12059], Coxsackievirus A10 (no rank) [taxon 42769], Orthomyxoviridae (family) [taxon 11308], Enterococcus sp. VD (species) [taxon 1483715], Enterovirus D (no rank) [taxon 138951], Homo sapiens (human, species) [taxon 9606], Enterovirus C (no rank) [taxon 138950]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12944942/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944942/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12944942/full.md

---
Source: https://tomesphere.com/paper/PMC12944942