Do Machine-Learning Atomic Descriptors and Order Parameters Tell the Same Story? The Case of Liquid Water
Edward Danquah Donkor, Alessandro Laio, Ali Hassanali

TL;DR
This study compares machine-learning atomic descriptors, specifically SOAP, with traditional order parameters in characterizing liquid water, revealing they provide different information and are not directly interchangeable.
Contribution
The paper introduces a statistical approach to assess the information content of SOAP descriptors versus standard order parameters in liquid water analysis.
Findings
SOAP descriptors and order parameters are not equivalent in information content.
SOAP environments are only approximately predicted by order parameters.
Metaparameters in SOAP influence the encoding of chemical information.
Abstract
Machine-learning (ML) has become a key workhorse in molecular simulations. Building an ML model in this context, involves encoding the information of chemical environments using local atomic descriptors. In this work, we focus on the Smooth Overlap of Atomic Positions (SOAP) and their application in studying the properties of liquid water both in the bulk and at the hydrophobic air-water interface. By using a statistical test aimed at assessing the relative information content of different distance measures defined on the same data space, we investigate if these descriptors provide the same information as some of the common order parameters that are used to characterize local water structure such as hydrogen bonding, density or tetrahedrality to name a few. Our analysis suggests that the ML description and the standard order parameters of local water structure are not equivalent. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Spectroscopy and Quantum Chemical Studies
