Size-and-Shape Space Gaussian Mixture Models for Structural Clustering of Molecular Dynamics Trajectories
Heidi Klem, Glen M. Hocky, and Martin McCullagh

TL;DR
This paper introduces shape-GMM, a novel clustering method for molecular dynamics trajectories that uses a weighted alignment and Gaussian mixture models to identify structural states with high accuracy, even in complex cases.
Contribution
The paper presents shape-GMM, a new approach combining weighted maximum likelihood alignment with GMMs to improve structural clustering of molecular configurations.
Findings
Successfully distinguishes structures indistinguishable by RMSD.
Supports a 4-state folding/unfolding mechanism in protein simulations.
Achieves kinetic detail comparable to state-of-the-art methods.
Abstract
Determining the optimal number and identity of structural clusters from an ensemble of molecular configurations continues to be a challenge. Recent structural clustering methods have focused on the use of internal coordinates due to the innate rotational and translational invariance of these features. The vast number of possible internal coordinates necessitates a feature space supervision step to make clustering tractable, but yields a protocol that can be system type specific. Particle positions offer an appealing alternative to internal coordinates, but suffer from a lack of rotational and translational invariance, as well as a perceived insensitivity to regions of structural dissimilarity. Here, we present a method, denoted shape-GMM, that overcomes the shortcomings of particle positions using a weighted maximum likelihood (ML) alignment procedure. This alignment strategy is then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Enzyme Structure and Function
