# Self-awareness of retrosynthesis via chemically inspired contrastive learning for reinforced molecule generation

**Authors:** Yi Zhang, Jindi Huang, Xinze Li, Wenqi Sun, Nana Zhang, Jiquan Zhang, Tiegen Chen, Ling Wang

PMC · DOI: 10.1093/bib/bbaf185 · Briefings in Bioinformatics · 2025-04-21

## TL;DR

A new molecule generation model uses contrastive learning and reinforcement learning to create valid and biologically active compounds for cancer treatment.

## Contribution

A novel molecule generation model that integrates chemically inspired contrastive learning with reinforcement learning to optimize molecular properties.

## Key findings

- The model generates 100% valid and novel molecular structures.
- Generated molecules show fewer structural alerts compared to baselines.
- Molecules demonstrate potent biological activity against ATR and CDK9 targets in experiments.

## Abstract

The recent progress of deep generative models in modeling complex real-world data distributions has enabled the generation of novel compounds with potential therapeutic applications for various diseases. However, most studies fail to optimize the properties of generated molecules from the perspective of the intrinsic nature of chemical reactions. In this work, we propose a novel molecule generation model to overcome the limitation by deep reinforcement learning, in which an agent learns to optimize the properties of molecules initialized with a chemically inspired contrastive pretrained model. We finally assess the generation model by evaluating its ability to generate inhibitors against two prominent therapeutic targets in cancer treatment. Experimental results show that our model could generate 100% valid and novel structures and also exhibits superior performance in generating molecules with fewer structural alerts against several baselines. More importantly, the molecules generated by our proposed model show potent biological activities against ataxia telangiectasia and Rad3-related (ATR) and cyclin-dependent kinase 9 (CDK9) targets in wet-lab experiments.

## Linked entities

- **Proteins:** ATR (ATR checkpoint kinase), CDK9 (cyclin dependent kinase 9)
- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Genes:** ATR (ATR checkpoint kinase) [NCBI Gene 545] {aka FCTCS, FRP1, MEC1, SCKL, SCKL1}, CDK9 (cyclin dependent kinase 9) [NCBI Gene 1025] {aka C-2k, CDC2L4, CTK1, PITALRE, TAK}
- **Diseases:** ataxia telangiectasia (MESH:D001260), cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12009711/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12009711/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/PMC12009711/full.md

---
Source: https://tomesphere.com/paper/PMC12009711