Information-theoretical measures identify accurate low-resolution representations of protein configurational space
Margherita Mele, Roberto Covino, Raffaello Potestio

TL;DR
This paper introduces an information-theoretical framework to evaluate and optimize clustering methods for simplifying protein conformational space in molecular dynamics simulations, revealing that ultrametric clustering best captures the self-similar organization of protein landscapes.
Contribution
It presents a novel application of information theory to assess and identify optimal low-resolution representations of protein conformational space, emphasizing ultrametric clustering.
Findings
Ultrametric clustering best captures the protein conformational landscape.
The framework balances simplicity and informativeness in data reduction.
The approach is applicable beyond biophysics to large datasets in various fields.
Abstract
A steadily growing computational power is employed to perform molecular dynamics simulations of biological macromolecules, which represents at the same time an immense opportunity and a formidable challenge. In fact, large amounts of data are produced, from which useful, synthetic, and intelligible information has to be extracted to make the crucial step from knowing to understanding. Here we tackled the problem of coarsening the conformational space sampled by proteins in the course of molecular dynamics simulations. We applied different schemes to cluster the frames of a dataset of protein simulations; we then employed an information-theoretical framework, based on the notion of resolution and relevance, to gauge how well the various clustering methods accomplish this simplification of the configurational space. Our approach allowed us to identify the level of resolution that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Gut microbiota and health · Cell Image Analysis Techniques
