# Transfer learning of hyperparameters for fast construction of anisotropic GPR models: design and application to the machine-learned force field FFLUX

**Authors:** Bienfait K. Isamura, Paul L. A. Popelier

PMC · DOI: 10.1039/d4cp01862a · Physical Chemistry Chemical Physics · 2024-09-03

## TL;DR

This paper introduces a transfer learning method to speed up training of machine learning models used in molecular simulations without losing accuracy.

## Contribution

A novel transfer learning protocol accelerates training of anisotropic GPR models for machine-learned force fields.

## Key findings

- TL models can be built ten times faster than traditional methods while maintaining accuracy.
- TL models perform as well or better than traditional models in molecular simulations.
- The protocol uses seeding–relaxation to identify good hyperparameter solutions efficiently.

## Abstract

The polarisable machine-learned force field FFLUX requires pre-trained anisotropic Gaussian process regression (GPR) models of atomic energies and multipole moments to propagate unbiased molecular dynamics simulations. The outcome of FFLUX simulations is highly dependent on the predictive accuracy of the underlying models whose training entails determining the optimal set of model hyperparameters. Unfortunately, traditional direct learning (DL) procedures do not scale well on this task, especially when the hyperparameter search is initiated from a (set of) random guess solution(s). Additionally, the complexity of the hyperparameter space (HS) increases with the number of geometrical input features, at least for anisotropic kernels, making the optimization of hyperparameters even more challenging. In this study, we propose a transfer learning (TL) protocol that accelerates the training process of anisotropic GPR models by facilitating access to promising regions of the HS. The protocol is based on a seeding–relaxation mechanism in which an excellent guess solution is identified by rapidly building one or several small source models over a subset of the target training set before readjusting the previous guess over the entire set. We demonstrate the performance of this protocol by building and assessing the performance of DL and TL models of atomic energies and charges in various conformations of benzene, ethanol, formic acid dimer and the drug fomepizole. Our experiments suggest that TL models can be built one order of magnitude faster while preserving the quality of their DL analogs. Most importantly, when deployed in FFLUX simulations, TL models compete with or even outperform their DL analogs when it comes to performing FFLUX geometry optimization and computing harmonic vibrational modes.

This work introduces a transfer learning (TL) protocol that accelerates the training of anisotropic Gaussian process regression models while preserving their excellent predictive capability.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11369757/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11369757/full.md

## References

62 references — full list in the complete paper: https://tomesphere.com/paper/PMC11369757/full.md

---
Source: https://tomesphere.com/paper/PMC11369757