Deep Signature: Characterization of Large-Scale Molecular Dynamics
Tiexin Qin, Mengxu Zhu, Chunyang Li, Terry Lyons, Hong Yan, Haoliang Li

TL;DR
Deep Signature introduces a new framework for analyzing complex protein dynamics using trajectory-based signatures, spectral clustering, and invariance properties, enabling better understanding of molecular interactions.
Contribution
It is the first to apply trajectory-based signature transforms combined with spectral clustering to characterize large-scale molecular dynamics.
Findings
Outperforms baseline methods on biological benchmarks.
Provides invariance to translation, rotation, and permutation.
Demonstrates theoretical properties ensuring robustness.
Abstract
Understanding protein dynamics are essential for deciphering protein functional mechanisms and developing molecular therapies. However, the complex high-dimensional dynamics and interatomic interactions of biological processes pose significant challenge for existing computational techniques. In this paper, we approach this problem for the first time by introducing Deep Signature, a novel computationally tractable framework that characterizes complex dynamics and interatomic interactions based on their evolving trajectories. Specifically, our approach incorporates soft spectral clustering that locally aggregates cooperative dynamics to reduce the size of the system, as well as signature transform that collects iterated integrals to provide a global characterization of the non-smooth interactive dynamics. Theoretical analysis demonstrates that Deep Signature exhibits several desirable…
Peer Reviews
Decision·ICLR 2025 Poster
1. Incorporates appropriate symmetries for (temporal) 3D point cloud learning. In particular, the invariance to time reparameterization is key because the underlying MD simulations might be prone to random restarts and miscellaneous artefacts preventing them from being a smooth "video" (ie, the MD itself might be erratic with the same state sampled repeatedly). 2. Deep Signature _learns_ the ideal CG beads relevant to the task, bypassing the need to manually remove degrees of freedom (eg: CA-le
The paper does not clarify the level of coarsening necessary to make path signature computations feasible. For instance, in gene regulatory dynamics, the graph was reduced from 100 nodes to 30, while the EGFR dataset was reduced from approximately 5000 atoms to 50 nodes, but the level of coarsening for the GPCR dataset is unspecified. This raises questions about whether 50 nodes is a computational limit for the method. Additionally, while protein structure can be intuitively coarsened into backb
1. The presentation of the method is very clear and easy to understand. 2. The motivation of applying signature transform is reasonable.
1. Lack of experiments on large dataset. As the paper claims, the Deep Signature framework can capture large-scale complex dynamics. So I think experiments on datasets with large amount of data and system size are necessary. But the paper only includes the experiments on datasets with large system size. 2. I think the baseline in this paper is too weak. For example, the author should compare the strong baseline with graph transformer architecture[1] based on the first/last frame of the trajecto
1. Authors develop an end-to-end framework to characterize interatomic interactions and dynamics of large-scale molecules, it shows improvement on three benchmarks and provides interpretability. 2. The size of the system is reduced by deep spectral clustering module without any expert knowledge. 3. The framework's desirable properties are supported by theoretical analysis.
1. The authors compare their approach to baseline methods, but a more comprehensive comparison with some SOTA baselines would provide a more robust evaluation. 2. The manuscript would benefit from a comparison of deep spectral clustering module with existing coarse graining methods. 3. An analysis of the model’s sensitivity to hyperparameters would provide insights into its robustness and reproducibility.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management
MethodsSpectral Clustering
