Decomposing Chemical Space: Applications to the Machine Learning of Atomic Energies
Frederik {\O}. Kjeldal, Janus J. Eriksen

TL;DR
This paper compares different atomic decomposition schemes on the QM7 dataset to understand how they reveal trends in atomic contributions to molecular energies and their implications for machine learning models.
Contribution
It introduces and evaluates various atomic decomposition methods, highlighting their differences and impacts on machine learning of atomic energies.
Findings
Decomposition schemes reveal diverse atomic contribution patterns.
Some schemes show unphysical basis set dependencies.
Insights inform the design of atomic-energy-based neural network models.
Abstract
We apply a number of atomic decomposition schemes across the standard QM7 dataset -- a small model set of organic molecules at equilibrium geometry -- to inspect the possible emergence of trends among contributions to atomization energies from distinct elements embedded within molecules. Specifically, a recent decomposition scheme of ours based on spatially localized molecular orbitals is compared to alternatives that instead partition molecular energies on account of which nuclei individual atomic orbitals are centred on. We find these partitioning schemes to expose the composition of chemical compound space in very dissimilar ways in terms of the grouping, binning, and heterogeneity of discrete atomic contributions, e.g., those associated with hydrogens bonded to different heavy atoms. Furthermore, unphysical dependencies on the one-electron basis set are found for some, but not all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · History and advancements in chemistry
