Structure-based Sampling and Self-correcting Machine Learning for Accurate Calculations of Potential Energy Surfaces and Vibrational Levels
Pavlo O. Dral, Alec Owens, Sergei N. Yurchenko, Walter Thiel

TL;DR
This paper introduces a structure-based sampling and self-correcting machine learning approach to efficiently generate accurate potential energy surfaces, significantly reducing the need for costly ab initio calculations in vibrational analysis.
Contribution
It presents a novel structure-based sampling method combined with self-correcting kernel ridge regression to accurately model PESs with minimal high-level ab initio data.
Findings
Achieved accurate PES predictions with only 10% of the data used for training.
Reduced ab initio calculations by up to 90% while maintaining accuracy.
Demonstrated effectiveness on methyl chloride PES with 44,819 points.
Abstract
We present an efficient approach for generating highly accurate molecular potential energy surfaces (PESs) using self-correcting, kernel ridge regression (KRR) based machine learning (ML). We introduce structure-based sampling to automatically assign nuclear configurations from a pre-defined grid to the training and prediction sets, respectively. Accurate high-level \textit{ab initio} energies are required only for the points in the training set, while the energies for the remaining points are provided by the ML model with negligible computational cost. The proposed sampling procedure is shown to be superior to random sampling and also eliminates the need for training several ML models. Self-correcting machine learning has been implemented such that each additional layer corrects errors from the previous layer. The performance of our approach is demonstrated in a case study on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
