In silico generation of novel, drug-like chemical matter using the LSTM neural network
Peter Ertl, Richard Lewis, Eric Martin, Valery Polyakov

TL;DR
This paper presents a method using LSTM neural networks to generate a large number of novel, drug-like molecules efficiently, with promising properties and bioactivity potential, supporting drug discovery efforts.
Contribution
The study introduces a novel LSTM-based approach for rapid generation of diverse, drug-like molecules with favorable properties, expanding the chemical space for drug discovery.
Findings
Generated one million molecules in 2 hours
Molecules are diverse and maintain drug-like features
Virtual screening shows bioactivity potential comparable to known bioactive molecules
Abstract
The exploration of novel chemical spaces is one of the most important tasks of cheminformatics when supporting the drug discovery process. Properly designed and trained deep neural networks can provide a viable alternative to brute-force de novo approaches or various other machine-learning techniques for generating novel drug-like molecules. In this article we present a method to generate molecules using a long short-term memory (LSTM) neural network and provide an analysis of the results, including a virtual screening test. Using the network one million drug-like molecules were generated in 2 hours. The molecules are novel, diverse (contain numerous novel chemotypes), have good physicochemical properties and have good synthetic accessibility, even though these qualities were not specific constraints. Although novel, their structural features and functional groups remain closely within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Protein Structure and Dynamics
