# De novo protein design: a transformative frontier in clinical protein applications

**Authors:** Jie Gao, Zaiyong Zheng, Xueting Yu, Yamei Luo, Yang Yu, Chunxiang Zhang

PMC · DOI: 10.1186/s12967-026-07784-0 · Journal of Translational Medicine · 2026-02-04

## TL;DR

This review discusses how designing proteins from scratch could revolutionize medicine by overcoming traditional protein development limitations.

## Contribution

The paper reviews computational strategies and deep learning's role in advancing de novo protein design for clinical applications.

## Key findings

- De novo protein design offers tailored structures and functions beyond native protein constraints.
- Deep learning has significantly contributed to recent progress in the field.
- Translational barriers and clinical feasibility remain major challenges for clinical application.

## Abstract

Protein biologics are indispensable in disease prevention, diagnosis, and therapy, yet their development remains largely constrained by reliance on native protein scaffolds, resulting in long development timelines, limited structural and functional tunability, challenges in manufacturing consistency, and high production costs.

De novo protein design moves beyond the structural and functional constraints inherent to traditional approaches, enabling the direct creation of proteins with tailored structures and functions and offering a new avenue to address these challenges. In this review, we summarize the principal computational strategies underlying de novo protein design and the contribution of deep learning to its recent progress, and highlight prospective applications, major translational barriers, and the current limitations and future challenges of the field.

Despite notable methodological progress in de novo protein design, its path toward clinical application continues to be limited by a range of biological, technical, and translational considerations. Future work will need closer coordination between computational design, experimental validation, engineering optimization, and clinical needs, with clinical feasibility considered early and refined throughout development.

## Full-text entities

- **Genes:** INS (insulin) [NCBI Gene 3630] {aka IDDM, IDDM1, IDDM2, ILPR, IRDN, MODY10}, Tnfrsf9 (tumor necrosis factor receptor superfamily, member 9) [NCBI Gene 21942] {aka 4-1BB, A930040I11Rik, CDw137, Cd137, ILA, Ly63}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, Ngfr (nerve growth factor receptor (TNFR superfamily, member 16)) [NCBI Gene 18053] {aka LNGFR, Tnfrsf16, p75, p75NGFR, p75NTR}, Il23r (interleukin 23 receptor) [NCBI Gene 209590] {aka IL-23R}, Fcgrt (Fc fragment of IgG receptor and transporter) [NCBI Gene 14132] {aka FcRn}, Ngf (nerve growth factor) [NCBI Gene 18049] {aka Ngfb, beta-NGF}, App (amyloid beta precursor protein) [NCBI Gene 11820] {aka Abeta, Abpp, Adap, Ag, Cvap, E030013M08Rik}, Gcg (glucagon) [NCBI Gene 14526] {aka GLP-1, Glu, PPG}, SNCA (synuclein alpha) [NCBI Gene 6622] {aka NACP, PARK1, PARK4, PD1}, Ntrk1 (neurotrophic tyrosine kinase, receptor, type 1) [NCBI Gene 18211] {aka Tkr, TrkA, trk}, Alb (albumin) [NCBI Gene 11657] {aka Alb-1, Alb1, BCL001, BCL002, BPL001}, Ctla4 (cytotoxic T-lymphocyte-associated protein 4) [NCBI Gene 12477] {aka Cd152, Ctla-4, Ly-56}, Cd274 (CD274 antigen) [NCBI Gene 60533] {aka A530045L16Rik, B7h1, Pdcd1l1, Pdcd1lg1, Pdl1}, Tnfsf9 (tumor necrosis factor (ligand) superfamily, member 9) [NCBI Gene 21950] {aka 4-1BB-L, 4-1BBL, Cd137l, Ly63l}, Tnf (tumor necrosis factor) [NCBI Gene 21926] {aka DIF, TNF-a, TNF-alpha, TNFSF2, TNFalpha, Tnfa}, Insr (insulin receptor) [NCBI Gene 16337] {aka 4932439J01Rik, CD220, D630014A15Rik, IR, IR-A, IR-B}, NUPR1 (nuclear protein 1, transcriptional regulator) [NCBI Gene 26471] {aka COM1, P8}, DNAJC5 (DnaJ heat shock protein family (Hsp40) member C5) [NCBI Gene 80331] {aka CLN4, CLN4B, CSP, DNAJC5A, mir-941-2, mir-941-3}, HTT (huntingtin) [NCBI Gene 3064] {aka HD, IT15, LOMARS}, IDH1 (isocitrate dehydrogenase (NADP(+)) 1) [NCBI Gene 3417] {aka HEL-216, HEL-S-26, IDCD, IDH, IDP, IDPC}, Pth (parathyroid hormone) [NCBI Gene 19226] {aka Pthp}, Tnfrsf1b (tumor necrosis factor receptor superfamily, member 1b) [NCBI Gene 21938] {aka CD120b, TNF-R-II, TNF-R2, TNF-R75, TNF-alphaR2, TNFBR}
- **Diseases:** Alzheimer's disease (MESH:D000544), cancer (MESH:D009369), Huntington's disease (MESH:D006816), inflammation (MESH:D007249), neurodegenerative diseases (MESH:D019636), Diseases (MESH:D004194), syphilis (MESH:D013587), pain (MESH:D010146), pancreatic cancer (MESH:D010190), Parkinson's (MESH:D010300), autoimmune (MESH:D001327), infection (MESH:D007239), Cardiovascular Diseases (MESH:D002318), colitis (MESH:D003092), COVID-19 (MESH:D000086382), toxicity (MESH:D064420), intracerebral hemorrhage (MESH:D002543), malaria (MESH:D008288), oncologic (MESH:D000072716), solid (MESH:D018250), infectious disease (MESH:D003141), hypersensitivity (MESH:D004342), cardiac injury (MESH:D006331)
- **Chemicals:** zinc (MESH:D015032), methotrexate (MESH:D008727), CMC (-), thyroxine (MESH:D013974), glucose (MESH:D005947), calcium (MESH:D002118), polyQ (MESH:C097188)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Escherichia coli (E. coli, species) [taxon 562], Human immunodeficiency virus 1 (no rank) [taxon 11676], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** Asp707  Ala, cysteine residue at position 62

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12958671/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12958671/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12958671/full.md

---
Source: https://tomesphere.com/paper/PMC12958671