Molecular Energy Learning Using Alternative Blackbox Matrix-Matrix Multiplication Algorithm for Exact Gaussian Process
Jiace Sun, Lixue Cheng, Thomas F. Miller III

TL;DR
This paper introduces an efficient alternative blackbox matrix-matrix multiplication algorithm to scale Gaussian Process training for molecular energies, achieving over 30-fold larger datasets with maintained accuracy.
Contribution
It proposes a new implementation of BBMM (AltBBMM) that significantly speeds up MOB-ML training without sacrificing accuracy or transferability.
Findings
Over 30 times scaling of MOB-ML training to 6500 molecules.
Maintains state-of-the-art accuracy in low-data regimes.
Achieves better accuracy than existing ML methods on molecular energies.
Abstract
We present an application of the blackbox matrix-matrix multiplication (BBMM) algorithm to scale up the Gaussian Process (GP) training of molecular energies in the molecular-orbital based machine learning (MOB-ML) framework. An alternative implementation of BBMM (AltBBMM) is also proposed to train more efficiently (over four-fold speedup) with the same accuracy and transferability as the original BBMM implementation. The training of MOB-ML was limited to 220 molecules, and BBMM and AltBBMM scale the training of MOB-ML up by over 30 times to 6500 molecules (more than a million pair energies). The accuracy and transferability of both algorithms are examined on the benchmark datasets of organic molecules with 7 and 13 heavy atoms. These lower-scaling implementations of the GP preserve the state-of-the-art learning efficiency in the low-data regime while extending it to the large-data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Gaussian Processes and Bayesian Inference
MethodsGaussian Process
