Towards a Universal Foundation Model for Protein Dynamics: A Multi-Chain Tree-Structured Framework with Transformer Propagators
Jinzhen Zhu

TL;DR
This paper introduces a universal, transformer-based framework for simulating protein dynamics efficiently, achieving high accuracy and generalization across diverse protein systems with significant speedup over traditional methods.
Contribution
The authors develop a hierarchical, multi-chain tree-structured representation combined with a transformer propagator, enabling fast, accurate, and generalizable protein dynamics simulations.
Findings
Achieves over 10,000x speedup compared to traditional MD.
Maintains statistical consistency with all-atom MD in RMSD profiles.
Successfully generalizes to multi-chain protein assemblies.
Abstract
Simulating large-scale protein dynamics using traditional all-atom molecular dynamics (MD) remains computationally prohibitive. We present a unified, universal framework for coarse-grained molecular dynamics (CG-MD) that achieves high-fidelity structural reconstruction and generalizes across diverse protein systems. Central to our approach is a hierarchical, tree-structured protein representation (TSCG) that maps Cartesian coordinates into a minimal set of interpretable collective variables. We extend this representation to accommodate multi-chain assemblies, demonstrating sub-angstrom precision in reconstructing full-atom structures from coarse-grained nodes. To model temporal evolution, we formulate protein dynamics as stochastic differential equations (SDEs), utilizing a Transformer-based architecture as a universal propagator. By representing collective variables as language-like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
