Graph Denoising Diffusion for Inverse Protein Folding

Kai Yi; Bingxin Zhou; Yiqing Shen; Pietro Li\`o; Yu Guang Wang

arXiv:2306.16819·q-bio.QM·November 8, 2023·20 cites

Graph Denoising Diffusion for Inverse Protein Folding

Kai Yi, Bingxin Zhou, Yiqing Shen, Pietro Li\`o, Yu Guang Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a graph denoising diffusion model for inverse protein folding that effectively generates diverse amino acid sequences conditioned on a given backbone, outperforming existing methods in sequence recovery.

Contribution

The paper presents a novel diffusion probabilistic model that incorporates biological priors and graph structures to improve diversity and accuracy in inverse protein folding.

Findings

01

Achieves state-of-the-art sequence recovery performance.

02

Generates diverse protein sequences for fixed backbones.

03

Utilizes amino acid replacement matrices to encode biological priors.

Abstract

Inverse protein folding is challenging due to its inherent one-to-many mapping characteristic, where numerous possible amino acid sequences can fold into a single, identical protein backbone. This task involves not only identifying viable sequences but also representing the sheer diversity of potential solutions. However, existing discriminative models, such as transformer-based auto-regressive models, struggle to encapsulate the diverse range of plausible solutions. In contrast, diffusion probabilistic models, as an emerging genre of generative approaches, offer the potential to generate a diverse set of sequence candidates for determined protein backbones. We propose a novel graph denoising diffusion model for inverse protein folding, where a given protein backbone guides the diffusion process on the corresponding amino acid residue types. The model infers the joint distribution of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ykiiiiii/grade_if
pytorchOfficial

Videos

Graph Denoising Diffusion for Inverse Protein Folding· slideslive

Taxonomy

TopicsProtein Structure and Dynamics · Machine Learning in Bioinformatics · RNA and protein synthesis mechanisms

MethodsDiffusion