Embedded-physics machine learning for coarse-graining and collective variable discovery without data
Markus Sch\"oberl, Nicholas Zabaras, Phaedon-Stelios Koutsourelakis

TL;DR
This paper introduces a physics-embedded machine learning framework for coarse-graining in molecular dynamics that does not require large datasets, enabling efficient surrogate modeling and discovery of meaningful collective variables.
Contribution
The authors develop a reverse KL divergence-based learning method that embeds physics directly, avoiding extensive MD simulations and enabling discovery of physically meaningful collective variables.
Findings
Effective surrogate model for atomistic configurations
Discovered collective variables relate to physicochemical properties
Improved predictive ability over traditional methods
Abstract
We present a novel learning framework that consistently embeds underlying physics while bypassing a significant drawback of most modern, data-driven coarse-grained approaches in the context of molecular dynamics (MD), i.e., the availability of big data. The generation of a sufficiently large training dataset poses a computationally demanding task, while complete coverage of the atomistic configuration space is not guaranteed. As a result, the explorative capabilities of data-driven coarse-grained models are limited and may yield biased "predictive" tools. We propose a novel objective based on reverse Kullback-Leibler divergence that fully incorporates the available physics in the form of the atomistic force field. Rather than separating model learning from the data-generation procedure - the latter relies on simulating atomistic motions governed by force fields - we query the atomistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning in Materials Science · Protein Structure and Dynamics
