# Machine Learning in Preclinical Development of Antiviral Peptide Candidates: A Review of the Current Landscape

**Authors:** Hannah Hargrove, Bei Tong, Amr Hussein Elkabanny, Xiaohui Frank Zhang

PMC · DOI: 10.3390/v18020260 · Viruses · 2026-02-19

## TL;DR

This review discusses how machine learning is being used to speed up and lower the cost of developing antiviral peptides in preclinical stages.

## Contribution

The paper provides a comprehensive overview of current machine learning applications in antiviral peptide design.

## Key findings

- Machine learning reduces the time and cost of preclinical antiviral peptide screening.
- ML enables exploration of a more diverse chemical space compared to traditional methods.
- In silico methods increase safety by reducing exposure to high BSL-rated viruses.

## Abstract

In the field of antiviral peptide (AVP) design, one of the most prominent limiting factors is the time and material cost required to perform the initial screening of novel AVPs. In particular, traditional target identification as well as traditional preclinical screening of novel drug candidates can be a very lengthy and expensive process. In recent decades, target identification and initial screening of AVPs has been increasingly carried out using machine learning (ML). The use of ML to initially screen potential interactions reduces the financial cost and lengthy time scale of preclinical AVP development, allowing for candidate peptides to be identified and screened faster, at a lower cost to both manufacturer and consumer. Additionally, the use of ML in generating and screening AVP candidates allows a more diverse chemical space to be explored than high-throughput screening methodologies allow. In silico generation and validation of AVP candidates also limits researcher contact with high BSL-rated viruses, thereby increasing the safety and accessibility of AVP design. This review seeks to provide a broad overview of the current uses of ML in early-stage AVP design, and to shed some light on the future direction of the field.

## Full-text entities

- **Genes:** CAMP (cathelicidin antimicrobial peptide) [NCBI Gene 820] {aka CAP-18, CAP18, CRAMP, FALL-39, FALL39, HSD26}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, DNAH8 (dynein axonemal heavy chain 8) [NCBI Gene 1769] {aka ATPase, SPGF46, hdhc9}, HDAC5 (histone deacetylase 5) [NCBI Gene 10014] {aka HD5, NY-CO-9}, AVP (arginine vasopressin) [NCBI Gene 551] {aka ADH, ARVP, AVP-NPII, AVRP, VP}, GPT (glutamic--pyruvic transaminase) [NCBI Gene 2875] {aka AAT1, ALT, ALT1, GPT1, SGPT}, S (surface glycoprotein) [NCBI Gene 43740568] {aka spike glycoprotein}, CBLIF (cobalamin binding intrinsic factor) [NCBI Gene 2694] {aka GIF, IF, IFMH, INF, TCN3}, MFSD11 (major facilitator superfamily domain containing 11) [NCBI Gene 79157] {aka ET}, ERVK-6 (endogenous retrovirus group K member 6, envelope) [NCBI Gene 64006] {aka ERVK6, HERV-K(C7), HERV-K108, K-Rev, c-orf, cORF}
- **Diseases:** allergic reaction (MESH:D004342), Drug (MESH:D000081015), nervous system disorders (MESH:D009422), viral (MESH:D014777), ADE (MESH:D064420), infection (MESH:D007239), gastrointestinal disorders (MESH:D005767), COVID-19 (MESH:D000086382), ML (MESH:D007859), and diarrhea (MESH:D003967), headaches, nausea (MESH:D009325), hemolysis (MESH:D006461), influenza (MESH:D007251), inflammation (MESH:D007249), injury to (MESH:D014947)
- **Chemicals:** glutamine (MESH:D005973), threonine (MESH:D013912), lysine (MESH:D008239), hydrogen (MESH:D006859), viscosin (MESH:C062796), tryptophan (MESH:D014364), FIRM (-), oil (MESH:D009821), arginine (MESH:D001120), thiol (MESH:D013438), amino acid (MESH:D000596), peptide (MESH:D010455), phospholipid (MESH:D010743), water (MESH:D014867), AMP (MESH:D000089882), metal (MESH:D008670), fostemsavir (MESH:C576364), proline (MESH:D011392), alanine (MESH:D000409), AVPs (MESH:C035362), carbon (MESH:D002244)
- **Species:** Infectious bronchitis virus (no rank) [taxon 11120], Homo sapiens (human, species) [taxon 9606], Adenoviridae (family) [taxon 10508], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Severe acute respiratory syndrome-related coronavirus (no rank) [taxon 694009], Enterovirus A71 (no rank) [taxon 39054], Human immunodeficiency virus 1 (no rank) [taxon 11676], Zaire ebolavirus (no rank) [taxon 186538], Ebola virus (no rank) [taxon 1570291], Zika virus (no rank) [taxon 64320], Middle East respiratory syndrome-related coronavirus (no rank) [taxon 1335626], Measles morbillivirus (no rank) [taxon 11234], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** HEp-2 — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_1906)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12944945/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944945/full.md

## References

74 references — full list in the complete paper: https://tomesphere.com/paper/PMC12944945/full.md

---
Source: https://tomesphere.com/paper/PMC12944945