TL;DR
This paper introduces a generalized SOAP kernel for atomistic machine learning that improves model accuracy and reveals insights into chemical correlations, effectively rediscovering the periodic table through data-driven methods.
Contribution
It presents a novel, generalized SOAP kernel incorporating multi-scale interactions and chemical correlations, enhancing ML model performance and interpretability in chemistry.
Findings
Improved accuracy in molecular and materials stability predictions.
Enhanced handling of complex, multi-component systems.
Machine learning models can rediscover the periodic table structure.
Abstract
Machine-learning of atomic-scale properties amounts to extracting correlations between structure, composition and the quantity that one wants to predict. Representing the input structure in a way that best reflects such correlations makes it possible to improve the accuracy of the model for a given amount of reference data. When using a description of the structures that is transparent and well-principled, optimizing the representation might reveal insights into the chemistry of the data set. Here we show how one can generalize the SOAP kernel to introduce a distance-dependent weight that accounts for the multi-scale nature of the interactions, and a description of correlations between chemical species. We show that this improves substantially the performance of ML models of molecular and materials stability, while making it easier to work with complex, multi-component systems and to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
