# A case study of the application of AI to early stage drug discovery

**Authors:** Abbi Abdel-Rehim, Larisa N. Soldatova, Ross D. King

PMC · DOI: 10.1038/s41598-025-32805-1 · Scientific Reports · 2025-12-26

## TL;DR

This study shows that ChatGPT can help design drug candidates early in the discovery process, producing molecules with promising predicted activity.

## Contribution

Demonstrates the use of a general-purpose AI (ChatGPT) for early-stage drug discovery tasks, including molecular ideation and candidate prioritization.

## Key findings

- ChatGPT generated EGFR inhibitors with predicted IC₅₀ values of ~10–50 nM after iterative optimization.
- A single attempt produced a de novo EGFR inhibitor with a predicted IC₅₀ of 94 nM.
- A non-covalent MCL1 inhibitor candidate achieved a docking score corresponding to a 39 nM dissociation constant.

## Abstract

Artificial intelligence (AI) has emerged as a powerful tool in drug discovery, offering the potential to expedite the design of novel therapeutics. This study evaluates the effectiveness of a general-purpose conversational AI, ChatGPT (GPT-4o), in performing three distinct drug discovery tasks, assessing its ability to assist in early-stage molecular ideation and design. In the first task, ChatGPT generated molecules starting from five low-affinity EGFR inhibitors (IC₅₀ values of 10–3.16 µM), which were iteratively optimized in a QSAR model to produce compounds with predicted IC₅₀ values of ~ 10–50 nM. In the second task, de novo design of EGFR inhibitors produced a molecule with a predicted IC₅₀ of 94 nM in a single attempt. In the third task, ChatGPT generated non-covalent MCL1 inhibitors, with a top candidate achieving a docking score corresponding to a 39 nM dissociation constant. Because AI-generated molecules often face synthetic feasibility challenges, we also identified readily available analogues from a chemical vendor. These analogues were evaluated using molecular docking (AutoDock Vina) and QSAR models, with several achieving a promising activity range of 10–100 nM across the three tasks. These results demonstrate that general-purpose AI models like ChatGPT can accelerate early-stage drug discovery by assisting in molecular ideation and candidate prioritization.

The online version contains supplementary material available at 10.1038/s41598-025-32805-1.

## Linked entities

- **Proteins:** EGFR (epidermal growth factor receptor), MCL1 (MCL1 apoptosis regulator, BCL2 family member)

## Full-text entities

- **Genes:** BCL2 (BCL2 apoptosis regulator) [NCBI Gene 596] {aka Bcl-2, PPP1R50}, EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}, GPT (glutamic--pyruvic transaminase) [NCBI Gene 2875] {aka AAT1, ALT, ALT1, GPT1, SGPT}, PRRT2 (proline rich transmembrane protein 2) [NCBI Gene 112476] {aka BFIC2, BFIS2, DSPB3, DYT10, EKD1, FICCA}, MCL1 (MCL1 apoptosis regulator, BCL2 family member) [NCBI Gene 4170] {aka BCL2L3, EAT, MCL1-ES, MCL1L, MCL1S, Mcl-1}
- **Diseases:** tumor (MESH:D009369), malaria (MESH:D008288), LLMs (MESH:D007806), toxicity (MESH:D064420)
- **Chemicals:** nitrogen (MESH:D009584), nitriles (MESH:D009570), quinazoline (MESH:D011799), benzimidazoles (MESH:D001562), ChatGPT (-), AZD5991 (MESH:C000629704), benzene (MESH:D001554), benzamide (MESH:C037689), sulfone (MESH:D013450), thiophene (MESH:D013876), MMFF (MESH:C067067), thiourea (MESH:D013890), Water (MESH:D014867), ATP (MESH:D000255), benzimidazole (MESH:C031000), bromine (MESH:D001966), Piperazine (MESH:D000077489), hydrogen (MESH:D006859), Tapotoclax (MESH:C000720001)
- **Mutations:** C797S, T790M

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12827298/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12827298/full.md

## References

5 references — full list in the complete paper: https://tomesphere.com/paper/PMC12827298/full.md

---
Source: https://tomesphere.com/paper/PMC12827298