Faster and more diverse de novo molecular optimization with double-loop   reinforcement learning using augmented SMILES

Esben Jannik Bjerrum; Christian Margreitter; Thomas Blaschke; Raquel; L\'opez-R\'ios de Castro

arXiv:2210.12458·physics.chem-ph·June 21, 2023·6 cites

Faster and more diverse de novo molecular optimization with double-loop reinforcement learning using augmented SMILES

Esben Jannik Bjerrum, Christian Margreitter, Thomas Blaschke, Raquel, L\'opez-R\'ios de Castro

PDF

Open Access

TL;DR

This paper introduces a double-loop reinforcement learning method with SMILES augmentation to accelerate molecular optimization, enhance diversity, and improve reproducibility in generative models for drug discovery.

Contribution

It proposes a novel double-loop reinforcement learning approach using SMILES augmentation to increase efficiency and diversity in molecular generation tasks.

Findings

01

Optimal augmentation repetitions are between 5 and 10.

02

The method increases diversity and similarity to known ligands.

03

It speeds up learning by reusing scoring calculations.

Abstract

Using generative deep learning models and reinforcement learning together can effectively generate new molecules with desired properties. By employing a multi-objective scoring function, thousands of high-scoring molecules can be generated, making this approach useful for drug discovery and material science. However, the application of these methods can be hindered by computationally expensive or time-consuming scoring procedures, particularly when a large number of function calls are required as feedback in the reinforcement learning optimization. Here, we propose the use of double-loop reinforcement learning with simplified molecular line entry system (SMILES) augmentation to improve the efficiency and speed of the optimization. By adding an inner loop that augments the generated SMILES strings to non-canonical SMILES for use in additional reinforcement learning rounds, we can both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMolecular Junctions and Nanostructures · Quantum-Dot Cellular Automata · Advanced biosensing and bioanalysis techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings