Learning Geometrically Disentangled Representations of Protein Folding Simulations
N. Joseph Tatro, Payel Das, Pin-Yu Chen, Vijil Chenthamarakshan,, Rongjie Lai

TL;DR
This paper introduces ProGAE, a geometric autoencoder that learns disentangled representations of protein structures from molecular simulations, enabling accurate, interpretable, and transferable generation of protein conformations.
Contribution
The work presents a novel geometric autoencoder framework that separates intrinsic and extrinsic protein geometries, improving interpretability and transferability in protein structure modeling.
Findings
ProGAE accurately reconstructs protein conformations at near-experimental resolution.
The model efficiently generates diverse structural variations of proteins.
ProGAE's transferability reduces retraining needs across different protein states.
Abstract
Massive molecular simulations of drug-target proteins have been used as a tool to understand disease mechanism and develop therapeutics. This work focuses on learning a generative neural network on a structural ensemble of a drug-target protein, e.g. SARS-CoV-2 Spike protein, obtained from computationally expensive molecular simulations. Model tasks involve characterizing the distinct structural fluctuations of the protein bound to various drug molecules, as well as efficient generation of protein conformations that can serve as an complement of a molecular simulation engine. Specifically, we present a geometric autoencoder framework to learn separate latent space encodings of the intrinsic and extrinsic geometries of the protein structure. For this purpose, the proposed Protein Geometric AutoEncoder (ProGAE) model is trained on the protein contact map and the orientation of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Cell Image Analysis Techniques · Genetics, Bioinformatics, and Biomedical Research
