GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles
Octavian-Eugen Ganea, Lagnajit Pattanaik, Connor W. Coley, Regina, Barzilay, Klavs F. Jensen, William H. Green, Tommi S. Jaakkola

TL;DR
GeoMol is an innovative end-to-end machine learning model that efficiently generates diverse, low-energy 3D conformer ensembles of molecules by predicting local structures and torsion angles, improving accuracy and speed over existing methods.
Contribution
It introduces a non-autoregressive, SE(3)-invariant approach using message passing neural networks to predict local geometries and torsion angles, avoiding over-parameterization and enhancing conformer diversity.
Findings
Outperforms existing models in accuracy and speed
Generates diverse low-energy conformers effectively
Leverages message passing neural networks for local structure prediction
Abstract
Prediction of a molecule's 3D conformer ensemble from the molecular graph holds a key role in areas of cheminformatics and drug discovery. Existing generative models have several drawbacks including lack of modeling important molecular geometry elements (e.g. torsion angles), separate optimization stages prone to error accumulation, and the need for structure fine-tuning based on approximate classical force-fields or computationally expensive methods such as metadynamics with approximate quantum mechanics calculations at each geometry. We propose GeoMol--an end-to-end, non-autoregressive and SE(3)-invariant machine learning approach to generate distributions of low-energy molecular 3D conformers. Leveraging the power of message passing neural networks (MPNNs) to capture local and global graph information, we predict local atomic 3D structures and torsion angles, avoiding unnecessary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Protein Structure and Dynamics
