Mask prior-guided denoising diffusion improves inverse protein folding
Peizhen Bai, Filip Miljkovi\'c, Xianyuan Liu, Leonardo De Maria, Rebecca Croasdale-Wood, Owen Rackham, Haiping Lu

TL;DR
This paper introduces MapDiff, a novel diffusion-based framework that improves inverse protein folding by effectively predicting uncertain regions and capturing structural details, leading to superior sequence design performance.
Contribution
MapDiff is the first discrete diffusion model for inverse protein folding that integrates structural information and residue interactions through a graph-based denoising network with mask-prior pre-training.
Findings
Outperforms state-of-the-art methods on four benchmarks.
Generated sequences closely match native protein characteristics.
Effectively predicts high-uncertainty disordered regions.
Abstract
Inverse protein folding generates valid amino acid sequences that can fold into a desired protein structure, with recent deep-learning advances showing strong potential and competitive performance. However, challenges remain, such as predicting elements with high structural uncertainty, including disordered regions. To tackle such low-confidence residue prediction, we propose a Mask-prior-guided denoising Diffusion (MapDiff) framework that accurately captures both structural information and residue interactions for inverse protein folding. MapDiff is a discrete diffusion probabilistic model that iteratively generates amino acid sequences with reduced noise, conditioned on a given protein backbone. To incorporate structural information and residue interactions, we develop a graph-based denoising network with a mask-prior pre-training strategy. Moreover, in the generative process, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · RNA Research and Splicing · Caveolin-1 and cellular processes
MethodsDropout · Diffusion
