Data-Driven Approach to Encoding and Decoding 3-D Crystal Structures
Jordan Hoffmann, Louis Maestrati, Yoshihide Sawada, Jian Tang, Jean, Michel Sellier, Yoshua Bengio

TL;DR
This paper introduces a neural network-based method for encoding and decoding complex 3-D crystal structures, enabling generation, interpolation, and modification of molecular configurations from a large dataset.
Contribution
It presents a novel approach to represent 3-D crystal structures with neural networks, extending generative modeling to complex, large molecules beyond small drug-like compounds.
Findings
Achieved accurate reconstruction of 3-D crystal structures
Enabled interpolation between different molecules
Generated new molecular configurations by sampling latent space
Abstract
Generative models have achieved impressive results in many domains including image and text generation. In the natural sciences, generative models have led to rapid progress in automated drug discovery. Many of the current methods focus on either 1-D or 2-D representations of typically small, drug-like molecules. However, many molecules require 3-D descriptors and exceed the chemical complexity of commonly used dataset. We present a method to encode and decode the position of atoms in 3-D molecules from a dataset of nearly 50,000 stable crystal unit cells that vary from containing 1 to over 100 atoms. We construct a smooth and continuous 3-D density representation of each crystal based on the positions of different atoms. Two different neural networks were trained on a dataset of over 120,000 three-dimensional samples of single and repeating crystal structures, made by rotating the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Protein Structure and Dynamics
