# A Machine Learning-Enabled Venom Peptide Platform for Rapid Drug Discovery

**Authors:** Fei Cai, Lijuan Zhou, Bryce Delgado, Wenping Chang, Jeffrey Tom, Evelyn Hernandez, Prajakta Joshi, Aimin Song, Matthieu Masureel, Henry R. Maun, Andrew Chang, Yingnan Zhang

PMC · DOI: 10.3390/ph19020288 · Pharmaceuticals · 2026-02-09

## TL;DR

This paper introduces a machine learning-powered platform for discovering venom peptides that can bind to complex proteins, offering a new way to develop small, stable therapeutic molecules.

## Contribution

A novel venom peptide discovery platform combining phage display, machine learning design, and recombinant expression for rapid identification of drug candidates.

## Key findings

- The VCX library successfully identified binders for four diverse targets with a 100% success rate.
- ML-assisted affinity maturation enabled rapid identification of DLL3 binder leads.
- The platform produces small, stable 'antibody-like' peptides suitable for targeting complex proteins.

## Abstract

Background/Objectives: Nature has evolved millions of venom-derived peptides with diverse biological functions, a substantial fraction of which target complex membrane proteins such as G-protein-coupled receptors and ion channels. Many of these peptides are stabilized by multiple disulfide bonds, endowing them with exceptional structural stability and favorable pharmacological properties. Methods: Leveraging this natural diversity, we developed a robust venom peptide therapeutics discovery system built on phage display technology and constructed a library using approximately 482 venom-derived scaffolds. The library design was guided by a machine learning (ML) model capable of predicting mutation-tolerant residues that preserve peptide foldability, maximizing structural integrity and sequence diversity. Results: The resulting VCX library was evaluated through screening against four diverse targets (CD47, DLL3, IL33, and P2X7R), yielding strong binders for all four, a success rate of 100%. Furthermore, by integrating high-throughput recombinant expression of thioredoxin–venom fusion proteins along with ML-assisted affinity maturation, we rapidly identified potential leads for DLL3 binders. Conclusions: This venom-based discovery platform offers significant advantages in both functionality and developability compared with conventional peptide discovery approaches. By combining natural structural diversity, ML-guided design, and recombinant expression, it enables efficient identification of “antibody-like” binders with molecular weights much smaller than those of antibodies. Consequently, it provides a powerful strategy for developing next-generation peptide therapeutics targeting challenging protein–protein interactions and complex membrane proteins.

## Linked entities

- **Proteins:** CD47 (CD47 molecule), DLL3 (delta like canonical Notch ligand 3), IL33 (interleukin 33), P2rx7 (purinergic receptor P2X, ligand-gated ion channel, 7), TRX1 (thioredoxin H-type 1)

## Full-text entities

- **Genes:** IL33 (interleukin 33) [NCBI Gene 90865] {aka C9orf26, DVS27, IL1F11, NF-HEV, NFEHEV}, CD47 (CD47 molecule) [NCBI Gene 961] {aka IAP, MER6, OA3}, CD33 (CD33 molecule) [NCBI Gene 945] {aka CD33rSiglec, SIGLEC-3, SIGLEC3, p67}, DLL3 (delta like canonical Notch ligand 3) [NCBI Gene 10683] {aka SCDO1}, TXN (thioredoxin) [NCBI Gene 7295] {aka TRDX, TRX, TRX1, TXN1, Trx80}
- **Diseases:** ML (MESH:D007859), inflammation (MESH:D007249), injury to (MESH:D014947), tumor (MESH:D009369), chronic pain (MESH:D059350), hypertension (MESH:D006973)
- **Chemicals:** Peptide (MESH:D010455), Exenatide (MESH:D000077270), NiCl2 (MESH:C022838), imidazole (MESH:C029899), HCl (MESH:D006851), SDS (MESH:D012967), carbenicillin (MESH:D002228), CaCl2 (MESH:D002122), biotin (MESH:D001710), NTA (MESH:D009571), TCEP (MESH:C080938), NaCl (MESH:D012965), metal (MESH:D008670), polyacrylamide (MESH:C016679), NNK (MESH:C016583), acetonitrile (MESH:C032159), nitrogen (MESH:D009584), Ni (MESH:D009532), Cys (MESH:D003545), lipid (MESH:D008055), IPTG (MESH:D007544), glutathione (MESH:D005978), ATP (MESH:D000255), DMSO (MESH:D004121), STA (MESH:C009695), PBS (MESH:D007854), disulfide (MESH:D004220), SA (MESH:D000077145), 9-fluorenylmethyloxycarbonyl (-), TFA (MESH:D014269), oxidized glutathione (MESH:D019803), ammonium bicarbonate (MESH:C027043), amino acid (MESH:D000596), Captopril (MESH:D002216), urea (MESH:D014508)
- **Species:** Escherichia coli (E. coli, species) [taxon 562], Trichoplusia ni (cabbage looper, species) [taxon 7111], Homo sapiens (human, species) [taxon 9606], Heloderma suspectum (Gila monster, species) [taxon 8554]
- **Mutations:** C33S
- **Cell lines:** M13 — Mus musculus (Mouse), Hybridoma (CVCL_B0XQ), Sf9 — Spodoptera frugiperda (Fall armyworm), Spontaneously immortalized cell line (CVCL_0549), Expi293F — Homo sapiens (Human), Transformed cell line (CVCL_D615), BL21(DE3) — Mus musculus (Mouse), Hybridoma (CVCL_B7HM)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12943374/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12943374/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC12943374/full.md

---
Source: https://tomesphere.com/paper/PMC12943374