Training Algorithm Matters for the Performance of Neural Network   Potential: A Case Study of Adam and the Kalman Filter Optimizers

Yunqi Shao; Florian M. Dietrich; Carl Nettelblad; Chao Zhang

arXiv:2109.03769·physics.chem-ph·December 15, 2021

Training Algorithm Matters for the Performance of Neural Network Potential: A Case Study of Adam and the Kalman Filter Optimizers

Yunqi Shao, Florian M. Dietrich, Carl Nettelblad, Chao Zhang

PDF

TL;DR

This study compares the effectiveness of Adam and EKF training algorithms for neural network potentials, revealing EKF's superior transferability and robustness, with performance linked to Fisher information rather than validation error.

Contribution

It introduces the implementation of EKF in TensorFlow for training neural network potentials and compares its performance to Adam using water datasets.

Findings

01

EKF-trained NNPs are more transferable.

02

EKF is less sensitive to learning rate variations.

03

Performance correlates with Fisher information measure.

Abstract

One hidden yet important issue for developing neural network potentials (NNPs) is the choice of training algorithm. Here we compare the performance of two popular training algorithms, the adaptive moment estimation algorithm (Adam) and the Extended Kalman Filter algorithm (EKF), using the Behler-Parrinello neural network (BPNN) and two publicly accessible datasets of liquid water [Proc. Natl. Acad. Sci. U.S.A. 2016, 113, 8368-8373 and Proc. Natl. Acad. Sci. U.S.A. 2019, 116, 1110-1115]. This is achieved by implementing EKF in TensorFlow. It is found that NNPs trained with EKF are more transferable and less sensitive to the value of the learning rate, as compared to Adam. In both cases, error metrics of the validation set do not always serve as a good indicator for the actual performance of NNPs. Instead, we show that their performance correlates well with a Fisher information based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdam