# Nature meets machine: the AI renaissance in natural product drug discovery

**Authors:** Rajesh Muthuraj, Jaikanth Chandrasekaran

PMC · DOI: 10.1007/s13659-025-00589-6 · Natural Products and Bioprospecting · 2026-03-02

## TL;DR

This paper explores how AI can enhance natural product drug discovery by addressing traditional challenges and enabling more efficient exploration of nature-derived compounds.

## Contribution

The paper introduces a framework for integrating AI across natural product discovery pipelines, highlighting novel applications like machine learning-driven dereplication and genome mining.

## Key findings

- AI tools like GNPS2 demonstrate scalable progress in natural product discovery workflows.
- AI integration accelerates early-stage discovery and improves translational relevance in antibiotic and anticancer research.
- Emerging AI paradigms, such as quantum machine learning, may further expand natural product research.

## Abstract

Natural products (NPs) have long served as a cornerstone of drug discovery, yielding landmark therapeutics such as paclitaxel and artemisinin and providing sustained access to biologically relevant chemical space. Despite this legacy, NP-based discovery has gradually declined with the rise of synthetic chemistry and high-throughput screening, even as many contemporary “synthetic” drugs remain structurally inspired by natural scaffolds. Classical NP workflows—centered on phenotypic screening and bioassay-guided fractionation—continue to face persistent bottlenecks, including structural complexity, low bioactive yield, frequent rediscovery, and limited scalability. Rather than competing with NP research, artificial intelligence (AI) offers a complementary methodological framework to address these longstanding challenges. This review critically examines the bottlenecks inherent to traditional NP discovery and outlines how AI can be systematically integrated across the pipeline. We discuss AI-enabled advances ranging from natural language processing for mining ethnopharmacological knowledge to machine learning–driven dereplication, cheminformatics, and genome mining, with platforms such as GNPS2 exemplifying scalable progress. Case studies in antibiotic and anticancer discovery, as well as the modernization of traditional medicine, illustrate how AI–NP integration can accelerate early-stage discovery while enhancing translational relevance. Looking ahead, we examine emerging paradigms—including quantum machine learning, federated data ecosystems, and AI-assisted molecular design—that may further expand the scope of NP-based research. Collectively, this review presents a forward-looking framework in which AI functions not as a replacement for NP science, but as a synergistic discipline that enables more efficient, scalable, and informed exploration of nature-derived chemical diversity.

The online version contains supplementary material available at 10.1007/s13659-025-00589-6.

## Linked entities

- **Chemicals:** paclitaxel (PubChem CID 36314), artemisinin (PubChem CID 68827)

## Full-text entities

- **Genes:** NINL (ninein like) [NCBI Gene 22981] {aka NLP}, KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}, FLT3LG (fms related receptor tyrosine kinase 3 ligand) [NCBI Gene 2323] {aka FL, FLG3L, FLT3L, IMD125}, COX1 (cytochrome c oxidase subunit I) [NCBI Gene 4512] {aka COI, MTCO1}
- **Diseases:** cancer (MESH:D009369), microtubule-disrupting (MESH:D019958), neurodegeneration (MESH:D019636), inflammatory (MESH:D007249), NP (MESH:D012893), NTD (MESH:D009436), non-small cell lung cancer (MESH:D002289), LLMs (MESH:D007806), AI (MESH:C538142), NTDs (MESH:D058069), ML (MESH:D007859), infection (MESH:D007239), COVID-19 (MESH:D000086382), chronic cough (MESH:D003371), toxicity (MESH:D064420), infectious diseases (MESH:D003141)
- **Chemicals:** marinopyrrole A (MESH:C528298), Taxol (MESH:D017239), vincristine (MESH:D014750), phorbol 12-myristate 13-acetate (MESH:D013755), quercetin (MESH:D011794), Simalikalactone E (MESH:C582753), beta-lapachone (MESH:C014638), carbon (MESH:D002244), AMPs (MESH:D000089882), artemisinin (MESH:C031327), alkaloid (MESH:D000470), polyketides (MESH:D061065), Halicin (MESH:C000717882), carbapenem (MESH:D015780), terpene (MESH:D013729), DP4-AI (-), curcumin (MESH:D003474), bruceine D (MESH:C030412), eleutherobin (MESH:C109551), morphine (MESH:D009020), hydrogen (MESH:D006859), flavonoids (MESH:D005419), Glycosides (MESH:D006027), Saponins (MESH:D012503), polyphenols (MESH:D059808), ATP (MESH:D000255), cysteine (MESH:D003545)
- **Species:** Scutellaria baicalensis (Baikal skullcap, species) [taxon 65409], Elephas antiquus (species) [taxon 251093], Escherichia coli (E. coli, species) [taxon 562], Papaver somniferum (opium poppy, species) [taxon 3469], Quassia amara (Amargo, species) [taxon 43725], Mammuthus primigenius (mammoth, species) [taxon 37349], Mycobacterium tuberculosis (species) [taxon 1773], Homo sapiens (human, species) [taxon 9606], Taxus brevifolia (Pacific yew, species) [taxon 46220], Trypanosoma (genus) [taxon 5690], Curcuma longa (turmeric, species) [taxon 136217], Corylus avellana (European hazelnut, species) [taxon 13451], Enterobacteriaceae (enterobacteria, family) [taxon 543], Acinetobacter baumannii (species) [taxon 470]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12953908/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12953908/full.md

## References

4 references — full list in the complete paper: https://tomesphere.com/paper/PMC12953908/full.md

---
Source: https://tomesphere.com/paper/PMC12953908