# Training neural nets to learn reactive potential energy surfaces using   interactive quantum chemistry in virtual reality

**Authors:** Silvia Amabilino, Lars A. Bratholm, Simon J. Bennie, Alain C. Vaucher,, Markus Reiher, and David R. Glowacki

arXiv: 1901.05417 · 2019-05-24

## TL;DR

This paper presents a GPU-accelerated neural network framework trained on data from interactive virtual reality molecular dynamics to accurately model reactive potential energy surfaces, improving sampling efficiency along reaction pathways.

## Contribution

It introduces a novel use of real-time interactive quantum chemistry in virtual reality for sampling training data for neural networks learning reactive PESs.

## Key findings

- iMD-VR sampling improves near-MEP data coverage.
- Neural networks trained on iMD-VR data predict energies well near the MEP.
- Training data quality influences neural network accuracy for off-path structures.

## Abstract

Whilst the primary bottleneck to a number of computational workflows was not so long ago limited by processing power, the rise of machine learning technologies has resulted in a paradigm shift which places increasing value on issues related to data curation - i.e., data size, quality, bias, format, and coverage. Increasingly, data-related issues are equally as important as the algorithmic methods used to process and learn from the data. Here we introduce an open source GPU-accelerated neural network (NN) framework for learning reactive potential energy surfaces (PESs), and investigate the use of real-time interactive ab initio molecular dynamics in virtual reality (iMD-VR) as a new strategy for rapidly sampling geometries along reaction pathways which can be used to train NNs to learn reactive PESs. Focussing on hydrogen abstraction reactions of CN radical with isopentane, we compare the performance of NNs trained using iMD-VR data versus NNs trained using a more traditional method, namely molecular dynamics (MD) constrained to sample a predefined grid of points along hydrogen abstraction reaction coordinates. Both the NN trained using iMD-VR data and the NN trained using the constrained MD data reproduce important qualitative features of the reactive PESs, such as a low and early barrier to abstraction. Quantitatively, learning is sensitive to the training dataset. Our results show that user-sampled structures obtained with the quantum chemical iMD-VR machinery enable better sampling in the vicinity of the minimum energy path (MEP). As a result, the NN trained on the iMD-VR data does very well predicting energies in the vicinity of the MEP, but less well predicting energies for 'off-path' structures. The NN trained on the constrained MD data does better in predicting energies for 'off-path' structures, given that it included a number of such structures in its training set.

---
Source: https://tomesphere.com/paper/1901.05417