Auto-WHATMD : Automated Wasserstein-based High-dimensional feature extraction Analysis of Trajectories from Molecular Dynamics
Sosuke Asano, Ikki Yasuda, Katsuhiro Endo, Yoshinori Hirano, Kenji Yasuoka

TL;DR
Auto-WHATMD is an automated method that uses Wasserstein distance and simulated annealing to identify key residues in high-dimensional molecular dynamics trajectories, aiding comparison and analysis of protein systems.
Contribution
The paper introduces auto-WHATMD, a novel automated approach for feature extraction from high-dimensional molecular dynamics data using optimal transport and simulated annealing.
Findings
Successfully identified discriminative residues in bromodomain 4.
Few residues captured ligand-binding affinity correlations.
Automated method reduces reliance on domain expertise.
Abstract
Comparing multiple protein systems with variation such as different binding ligands or mutations, and understanding their effects is one of the objectives in molecular dynamics simulations. Representation of these systems by a few features enables quantitative comparison. However, because molecular dynamics simulation trajectories are high-dimensional spatiotemporal data, selection of key features relies on domain expertise, sometimes introducing arbitrary assumptions. Here, we present an approach that uses the optimal transport distance to compare high-dimensional trajectory data, and employs simulated annealing to identify the residues that best distinguish multiple systems. We term this algorithm auto-WHATMD (automated Wasserstein-based High-dimensional feature extraction Analysis for Trajectories of Molecular Dynamics). We applied auto-WHATMD to multiple protein-ligand systems of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Degradation and Inhibitors · Protein Structure and Dynamics · Genomics and Chromatin Dynamics
