Grammars and reinforcement learning for molecule optimization

Egor Kraev

arXiv:1811.11222·cs.LG·November 29, 2018·5 cites

Grammars and reinforcement learning for molecule optimization

Egor Kraev

PDF

Open Access 1 Repo

TL;DR

This paper introduces a simplified method for generating valid SMILES strings using a new grammar and applies reinforcement learning with a Transformer model to optimize molecular properties efficiently, outperforming previous methods.

Contribution

It presents a novel grammar-based approach for valid molecule generation and casts molecule optimization as a reinforcement learning problem with improved efficiency and performance.

Findings

01

Outperforms previous state-of-the-art baselines

02

Enables larger molecule generation with fewer model steps

03

Applicable to graph structure optimization with complex constraints

Abstract

We seek to automate the design of molecules based on specific chemical properties. Our primary contributions are a simpler method for generating SMILES strings guaranteed to be chemically valid, using a combination of a new context-free grammar for SMILES and additional masking logic; and casting the molecular property optimization as a reinforcement learning problem, specifically best-of-batch policy gradient applied to a Transformer model architecture. This approach uses substantially fewer model steps per atom than earlier approaches, thus enabling generation of larger molecules, and beats previous state-of-the art baselines by a significant margin. Applying reinforcement learning to a combination of a custom context-free grammar with additional masking to enforce non-local constraints is applicable to any optimization of a graph structure under a mixture of local and nonlocal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZmeiGorynych/generative_playground
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Chemical Synthesis and Analysis

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax