Generating stable molecules using imitation and reinforcement learning

S{\o}ren Ager Meldgaard; Jonas K\"ohler; Henrik Lund Mortensen,; Mads-Peter V. Christiansen; Frank No\'e; Bj{\o}rk Hammer

arXiv:2107.05007·physics.chem-ph·July 13, 2021

Generating stable molecules using imitation and reinforcement learning

S{\o}ren Ager Meldgaard, Jonas K\"ohler, Henrik Lund Mortensen,, Mads-Peter V. Christiansen, Frank No\'e, Bj{\o}rk Hammer

PDF

TL;DR

This paper introduces a reinforcement learning method that generates stable molecules in 3D space, improving the discovery of low-energy, stable compounds by combining imitation learning and quantum chemical predictions.

Contribution

It presents a novel reinforcement learning framework that generates molecules in Cartesian coordinates, incorporating quantum chemical stability predictions, and enhances sample efficiency through imitation learning.

Findings

01

Successfully identifies low-energy molecules in the database.

02

Generates novel isomers not present in training data.

03

Refines molecule generation for larger, more complex molecules.

Abstract

Chemical space is routinely explored by machine learning methods to discover interesting molecules, before time-consuming experimental synthesizing is attempted. However, these methods often rely on a graph representation, ignoring 3D information necessary for determining the stability of the molecules. We propose a reinforcement learning approach for generating molecules in cartesian coordinates allowing for quantum chemical prediction of the stability. To improve sample-efficiency we learn basic chemical rules from imitation learning on the GDB-11 database to create an initial model applicable for all stoichiometries. We then deploy multiple copies of the model conditioned on a specific stoichiometry in a reinforcement learning setting. The models correctly identify low energy molecules in the database and produce novel isomers not found in the training set. Finally, we apply the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.