Graph Denoising Diffusion for Inverse Protein Folding
Kai Yi, Bingxin Zhou, Yiqing Shen, Pietro Li\`o, Yu Guang Wang

TL;DR
This paper introduces a graph denoising diffusion model for inverse protein folding that effectively generates diverse amino acid sequences conditioned on a given backbone, outperforming existing methods in sequence recovery.
Contribution
The paper presents a novel diffusion probabilistic model that incorporates biological priors and graph structures to improve diversity and accuracy in inverse protein folding.
Findings
Achieves state-of-the-art sequence recovery performance.
Generates diverse protein sequences for fixed backbones.
Utilizes amino acid replacement matrices to encode biological priors.
Abstract
Inverse protein folding is challenging due to its inherent one-to-many mapping characteristic, where numerous possible amino acid sequences can fold into a single, identical protein backbone. This task involves not only identifying viable sequences but also representing the sheer diversity of potential solutions. However, existing discriminative models, such as transformer-based auto-regressive models, struggle to encapsulate the diverse range of plausible solutions. In contrast, diffusion probabilistic models, as an emerging genre of generative approaches, offer the potential to generate a diverse set of sequence candidates for determined protein backbones. We propose a novel graph denoising diffusion model for inverse protein folding, where a given protein backbone guides the diffusion process on the corresponding amino acid residue types. The model infers the joint distribution of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsProtein Structure and Dynamics · Machine Learning in Bioinformatics · RNA and protein synthesis mechanisms
MethodsDiffusion
