Cut-SOAP: A Machine Learning Descriptor for Rapid Screening of Molecular Adsorption Energetics
Felipe V. Calderan, Karla F. Andriani, Priscilla Felício-Sousa, Gabriel A. Pinheiro, Juarez L. F. Da Silva, Marcos G. Quiles

TL;DR
This paper introduces Cut-SOAP, a machine learning method that quickly and accurately predicts molecular adsorption energies at a fraction of the computational cost of traditional methods.
Contribution
The novel Cut-SOAP descriptor significantly reduces feature dimensionality while maintaining accuracy for adsorption energy prediction.
Findings
Cut-SOAP reduces feature dimensionality by over 97% without significant loss of data quality.
The deep neural network model achieves a mean absolute error below 0.1 eV on standard test sets.
The model maintains robust performance with a mean absolute error below 1.0 eV on out-of-distribution data.
Abstract
Adsorption energy is a fundamental property in catalysis and chemical reaction studies; however, conventional quantum chemistry methods, such as density functional theory, provide high accuracy but are often computationally expensive or even impractical for screening large data sets or complex chemical systems. In this work, we proposed a machine learning (ML) pipeline that efficiently predicts relative energy interactions for molecular adsorption near the minimum molecule–cluster distance, at a fraction of the computational cost of quantum chemistry-based methods. Our approach begins by transforming the Fritz–Haber Institute ab initio materials simulation (FHI-aims) output data into feature arrays through a modified version of the Smooth Overlap of Atomic Positions (SOAP) descriptor, which we call Cut-SOAP. This modification reduces the dimensionality of the features by more than 97%…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Catalysis and Oxidation Reactions · Electrocatalysts for Energy Conversion
