Atom-Density Representations for Machine Learning
Michael J. Willatt, Felix Musil, Michele Ceriotti

TL;DR
This paper introduces a unified, basis set independent framework for atomic environment representations in machine learning, leveraging smoothed atomic densities and inner products to improve efficiency and systematic tuning.
Contribution
It formalizes a basis set independent, density-based representation of atomic environments, connecting existing methods and enabling systematic optimization for materials and molecules.
Findings
Provides a formalism connecting SOAP and real-space correlations.
Lays groundwork for systematic tuning of representations.
Unifies recent developments in atomic environment representations.
Abstract
The applications of machine learning techniques to chemistry and materials science become more numerous by the day. The main challenge is to devise representations of atomic systems that are at the same time complete and concise, so as to reduce the number of reference calculations that are needed to predict the properties of different types of materials reliably. This has led to a proliferation of alternative ways to convert an atomic structure into an input for a machine-learning model. We introduce an abstract definition of chemical environments that is based on a smoothed atomic density, using a bra-ket notation to emphasize basis set independence and to highlight the connections with some popular choices of representations for describing atomic systems. The correlations between the spatial distribution of atoms and their chemical identities are computed as inner products between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
