# Gene Expression, Docking and Machine Learning in Malaria Drug Discovery: A Systematic Review

**Authors:** Reuben Samson Dangana, Israel Ehizuelen Ebhohimen, Samson Anjikwi Malgwi, Samuel Chima Ugbaja, Moses Okpeku

PMC · DOI: 10.1155/bri/2724332 · Biochemistry Research International · 2026-02-26

## TL;DR

This review explores how gene expression, molecular docking, and machine learning are being used to discover new malaria drugs, focusing on herbal compounds and computational methods.

## Contribution

The paper systematically reviews the integration of molecular and computational techniques for antimalarial drug discovery from 2014 to 2024.

## Key findings

- Molecular docking was the most used technique, with compounds like isorhamnetin showing strong binding to Plasmodium proteins.
- ML models predicted bioactivity and resistance patterns, identifying flavonoids and terpenoids as promising drug candidates.
- RNA-seq analysis revealed key genes modulated by herbal treatments, including those involved in apoptosis and inflammation.

## Abstract

Malaria remains a significant and worldwide health threat with increasing resistance to current treatments, stimulating the demand for innovative approaches in pursuing drug discovery. This systematic review integrates the progress made from 2014 through 2024 regarding molecular methods like gene expression profiling, molecular docking and machine learning to understand the biology of Plasmodium and identify new drug targets and compounds, focusing on herbal remedies and computational methods.

Several studies were found using a PRISMA‐guided search of PubMed, Scopus and Web of Science (64 studies found). The data extracted were gene expression outcomes, docking affinities, ML models and experimental validations (in vitro/in vivo).

Molecular docking emerged as the dominant technique (32.37%), followed by in vitro antiplasmodial assays (14.39%), ADMET profiling (10.79%) and gene expression studies (3.60%). RNA‐seq analysis revealed key host and parasite genes modulated by herbal treatments, including those involved in apoptosis and inflammation. Notably, compounds like isorhamnetin and myricetin 3‐O‐glucoside showed exceptionally high binding affinities to Plasmepsin II and Plasmodium falciparum lactate dehydrogenase (PfLDH) (ΔG < −13 kcal/mol). ML models like random forest and support vector machine (SVM) exhibited high predictive results (AUC value up to 0.87) for bioactivity and resistance patterns that showed flavonoids (quercetin) and terpenoids (eugenol) as good candidates. Pathways that are often attacked are haemoglobin degradation, glycolysis, pyrimidine metabolism and protein synthesis.

Multiomics, docking and ML integration improve the target identification and prioritise the compounds. This review illustrates the great potential of molecular techniques for the development of drugs against antimalarial helicases that are not resistant to drug therapy. However, in vivo data holes and methodology inconsistency limit clinical translation. Future work should include standardisation of protocols and studies of synergistic combinations of phytochemicals.

## Linked entities

- **Chemicals:** isorhamnetin (PubChem CID 5281654), myricetin 3-O-glucoside (PubChem CID 5318606), quercetin (PubChem CID 5280343), eugenol (PubChem CID 3314)
- **Diseases:** malaria (MONDO:0005136)
- **Species:** Plasmodium (taxon 5820)

## Full-text entities

- **Genes:** HDGFL2 (HDGF like 2) [NCBI Gene 84717] {aka HDGF-2, HDGF2, HDGFRP2, HRP-2, HRP2}, Actin [NCBI Gene 3427973], DNA topoisomerase II [NCBI Gene 3424932], Choline kinase [NCBI Gene 3423063]
- **Diseases:** inflammation (MESH:D007249), PfLDH (MESH:D016778), cardiotoxicity (MESH:D066126), deaths (MESH:D003643), Cytotoxicity (MESH:D064420), cerebral malaria (MESH:D016779), infections (MESH:D007239), Coma (MESH:D003128), ADMET (MESH:C562790), infectious diseases (MESH:D003141), Malaria (MESH:D008288)
- **Chemicals:** artemisinin B (MESH:C073954), aloesaponarin I (MESH:C584681), bergenin (MESH:C006741), isovitexin (MESH:C049772), sesamin (MESH:C054125), Cosmosiin (MESH:C057792), PBSA (MESH:C437084), Trolox (MESH:C010643), eucalyptol (MESH:D000077591), substances (MESH:C012600), Limonin (MESH:C001546), limonene (MESH:D000077222), caffeoylquinic acids (MESH:C472707), betulinic acid (MESH:D000094062), rutin (MESH:D012431), pinocembrin (MESH:C016063), andrographolide (MESH:C030419), quercetin (MESH:D011794), friedelin (MESH:C060796), myricetin (MESH:C040015), ursolic acid (MESH:C005466), EOs (MESH:D009822), thiobarbituric acid (MESH:C029684), dimethylmatairesinol (MESH:C508933), terpenoids (MESH:D013729), afzelin (MESH:C477954), curcuminoid (MESH:D036381), alpha-pinene (MESH:C005451), kaempferol (MESH:C006552), artesunate (MESH:D000077332), Pyrimidine (MESH:C030986), glycyrrhizin (MESH:D019695), lupeol (MESH:C010480), artemisinin (MESH:C031327), epicatechin-gallate (MESH:C062669), ascorbic acid (MESH:D001205), 2BJU (-), purfalcamine (MESH:C531239), rhaponticin (MESH:C023538), myricitrin (MESH:C008577), halofuginone (MESH:C010176), phosphatidylcholine (MESH:D010713), Fatty acid (MESH:D005227), isoquercitrin (MESH:C016527), astragalin (MESH:C001579), amino acid (MESH:D000596), MTT (MESH:C070243), lipid (MESH:D008055), quercitrin (MESH:C012526), purine (MESH:C030985), beta-carboline (MESH:C010262), cynaroside (MESH:C066408), Knipholone (MESH:C403586), SYBR Green I (MESH:C098022), scoparone (MESH:C018145), Flavonoids (MESH:D005419), eugenol (MESH:D005054), scopoletin (MESH:D012603), 11-O-galloylbergenin (MESH:C556377), calcium (MESH:D002118)
- **Species:** Plasmodium berghei ANKA (strain) [taxon 5823], Xanthium strumarium var. canadense (varietas) [taxon 552636], Plasmodium berghei (species) [taxon 5821], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Terminalia albida (species) [taxon 2801321], Morus alba (white mulberry, species) [taxon 3498], Garcinia mangostana (mangosteen, species) [taxon 58228], Aloe marlothii (species) [taxon 992641], Mus musculus (house mouse, species) [taxon 10090], Artemisia annua (sweet Annie, species) [taxon 35608], Euphorbia hirta (asthma-plant, species) [taxon 318062], Hibiscus cannabinus (kenaf, species) [taxon 229543], Danio rerio (leopard danio, species) [taxon 7955], Artocarpus (genus) [taxon 3488], Plasmodium falciparum (malaria parasite P. falciparum, species) [taxon 5833], Andrographis paniculata (species) [taxon 175694], Anopheles gambiae (African malaria mosquito, species) [taxon 7165], Cordia myxa (Assyrian-plum, species) [taxon 181185], Plasmodium vivax (malaria parasite P. vivax, species) [taxon 5855], Corchorus capsularis (jute, species) [taxon 210143]
- **Cell lines:** 3D7 — Mus musculus (Mouse), Hybridoma (CVCL_KS87), NF54 — Homo sapiens (Human), Ovarian carcinosarcoma, Cancer cell line (CVCL_W770), Caco-2 — Homo sapiens (Human), Colon adenocarcinoma, Cancer cell line (CVCL_0025)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12945668/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12945668/full.md

## References

86 references — full list in the complete paper: https://tomesphere.com/paper/PMC12945668/full.md

---
Source: https://tomesphere.com/paper/PMC12945668