Kernel Ridge Regression for conformer ensembles made easy with Structured Orthogonal Random Features
Konstantin Karandashev

TL;DR
This paper introduces a computationally efficient kernel ridge regression method using structured orthogonal random features, tailored for chemical conformer ensembles, demonstrating high accuracy in materials discovery tasks.
Contribution
The paper develops a novel physics-motivated neural network approach based on structured orthogonal random features for efficient machine learning in chemical space.
Findings
Achieved experimental accuracy with hundreds of molecules.
Prediction errors comparable to state-of-the-art methods.
Demonstrated method's flexibility across different molecular representations.
Abstract
A computationally efficient protocol for machine learning in chemical space using Boltzmann ensembles of conformers as input is proposed; the method is based on rewriting Kernel Ridge Regression expressions in terms of Structured Orthogonal Random Features, yielding physics-motivated trigonometric neural networks. To evaluate the method's utility for materials discovery, we test it on experimental datasets of two quantities related to battery electrolyte design, namely oxidation potentials in acetonitrile and hydration energies, using several popular molecular representations to demonstrate the method's flexibility. Despite only using computationally cheap forcefield calculations for conformer generation, we observe systematic decrease of machine learning error with increased training set size in all cases, with experimental accuracy reached after training on hundreds of molecules and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Gaussian Processes and Bayesian Inference
