Chemical Space-Informed Machine Learning Models for Rapid Predictions of X-ray Photoelectron Spectra of Organic Molecules
Susmita Tripathy, Surajit Das, Shweta Jindal, Raghunathan, Ramakrishnan

TL;DR
This study develops machine learning models using kernel-ridge regression and transfer learning to rapidly predict X-ray photoelectron spectra of organic molecules, enabling efficient virtual screening with reasonable accuracy.
Contribution
The paper introduces a cost-effective ML framework combining kernel-ridge regression, transfer learning, and baseline spectra to predict core-electron binding energies for organic molecules.
Findings
Models show strong linear correlation with target CEBEs.
Applicable to large, substituted molecules beyond training set.
Dataset and models are available as an open Python module.
Abstract
We present machine learning models based on kernel-ridge regression for predicting X-ray photoelectron spectra of organic molecules originating from the -shell ionization energies of carbon (C), nitrogen (N), oxygen (O), and fluorine (F) atoms. We constructed the training dataset through high-throughput calculations of -shell core-electron binding energies (CEBEs) for 12,880 small organic molecules in the bigQM7 dataset, employing the -SCF formalism coupled with meta-GGA-DFT and a variationally converged basis set. The models are cost-effective, as they require the atomic coordinates of a molecule generated using universal force fields while estimating the target-level CEBEs corresponding to DFT-level equilibrium geometry. We explore transfer learning by utilizing the atomic environment feature vectors learned using a graph neural network framework in kernel-ridge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Chemical Sensor Technologies · Various Chemistry Research Topics
