ML Models of Vibrating H$_2$CO: Comparing Reproducing Kernels, FCHL and PhysNet
Silvan K\"aser, Debasish Koner, Anders S. Christensen, O., Anatole von Lilienfeld, Markus Meuwly

TL;DR
This study compares various machine learning models, including neural networks and kernel methods, for predicting molecular properties of formaldehyde, showing their strengths and limitations in accuracy, extrapolation, and vibrational spectra.
Contribution
It provides a comprehensive comparison of state-of-the-art ML models for molecular simulations, highlighting their convergence behavior and transfer learning capabilities.
Findings
Neural networks converge faster for energies and forces.
Kernel methods excel in extrapolation to new geometries.
All models perform well in reproducing infrared spectra.
Abstract
Machine Learning (ML) has become a promising tool for improving the quality of atomistic simulations. Using formaldehyde as a benchmark system for intramolecular interactions, a comparative assessment of ML models based on state-of-the-art variants of deep neural networks (NN), reproducing kernel Hilbert space (RKHS+F), and kernel ridge regression (KRR) is presented. Learning curves for energies and atomic forces indicate rapid convergence towards excellent predictions for B3LYP, MP2, and CCSD(T)-F12 reference results for modestly sized (in the hundreds) training sets. Typically, learning curve off-sets decay as one goes from NN (PhysNet) to RKHS+F to KRR (FCHL). Conversely, the predictive power for extrapolation of energies towards new geometries increases in the same order with RKHS+F and FCHL performing almost equally. For harmonic vibrational frequencies, the picture is less clear,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
