LaGDif: Latent Graph Diffusion Model for Efficient Protein Inverse Folding with Self-Ensemble
Taoyu Wu, Yu Guang Wang, Yiqing Shen

TL;DR
LaGDif introduces a continuous latent space diffusion model for protein inverse folding, significantly improving sequence recovery and structural accuracy over existing methods.
Contribution
It proposes a novel latent graph diffusion approach that bridges discrete and continuous data, and introduces a self-ensemble method for more stable and accurate protein sequence prediction.
Findings
Achieves up to 45.55% improvement in sequence recovery rate.
Maintains an average RMSD of 1.96 Å between generated and native structures.
Outperforms existing state-of-the-art techniques on the CATH dataset.
Abstract
Protein inverse folding aims to identify viable amino acid sequences that can fold into given protein structures, enabling the design of novel proteins with desired functions for applications in drug discovery, enzyme engineering, and biomaterial development. Diffusion probabilistic models have emerged as a promising approach in inverse folding, offering both feasible and diverse solutions compared to traditional energy-based methods and more recent protein language models. However, existing diffusion models for protein inverse folding operate in discrete data spaces, necessitating prior distributions for transition matrices and limiting smooth transitions and gradients inherent to continuous spaces, leading to suboptimal performance. Drawing inspiration from the success of diffusion models in continuous domains, we introduce the Latent Graph Diffusion Model for Protein Inverse Folding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Biofuel production and bioconversion
MethodsDiffusion
