Data-driven Uncertainty Quantification for Systematic Coarse-grained Models
Tangxin Jin, Anthony Chazirakis, Evangelia Kalligiannaki, Vagelis, Harmandaris, Markos A. Katsoulakis

TL;DR
This paper introduces statistical methods to quantify confidence in coarse-grained models of molecular systems, addressing errors from limited data and demonstrating applications on diffusion processes and polyethylene melts.
Contribution
It develops and applies non-asymptotic statistical techniques for uncertainty quantification in coarse-graining, enhancing model reliability assessment.
Findings
Confidence sets can be estimated using bootstrap and jackknife methods.
Asymptotic confidence intervals are feasible with sufficient sampling.
Uncertainty quantification improves model validation for polymer systems.
Abstract
In this work, we present methodologies for the quantification of confidence in bottom-up coarse-grained models for molecular and macromolecular systems. Coarse-graining methods have been extensively used in the past decades in order to extend the length and time scales accessible by simulation methodologies. The quantification, though, of induced errors due to the limited availability of fine-grained data is not yet established. Here, we employ rigorous statistical methods to deduce guarantees for the optimal coarse models obtained via approximations of the multi-body potential of mean force, with the relative entropy, the relative entropy rate minimization, and the force matching methods. Specifically, we present and apply statistical approaches, such as bootstrap and jackknife, to infer confidence sets for a limited number of samples, i.e., molecular configurations. Moreover, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbabilistic and Robust Engineering Design · Markov Chains and Monte Carlo Methods · Protein Structure and Dynamics
