All-atom inverse protein folding through discrete flow matching
Kai Yi, Kiarash Jamali, Sjors H. W. Scheres

TL;DR
ADFLIP is a novel all-atom inverse protein folding model that uses discrete flow matching to design sequences conditioned on structural context, excelling in complex and dynamic protein structures.
Contribution
This work introduces ADFLIP, a new generative model for inverse protein folding that incorporates all-atom structural context and ensemble sampling, outperforming previous methods.
Findings
Achieves state-of-the-art performance in inverse folding tasks.
Effectively designs sequences for dynamic and multi-state protein complexes.
Incorporates pre-trained models for property-guided sequence optimization.
Abstract
The recent breakthrough of AlphaFold3 in modeling complex biomolecular interactions, including those between proteins and ligands, nucleotides, or metal ions, creates new opportunities for protein design. In so-called inverse protein folding, the objective is to find a sequence of amino acids that adopts a target protein structure. Many inverse folding methods struggle to predict sequences for complexes that contain non-protein components, and perform poorly with complexes that adopt multiple structural states. To address these challenges, we present ADFLIP (All-atom Discrete FLow matching Inverse Protein folding), a generative model based on discrete flow-matching for designing protein sequences conditioned on all-atom structural contexts. ADFLIP progressively incorporates predicted amino acid side chains as structural context during sequence generation and enables the design of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
