AlphaFold Distillation for Protein Design
Igor Melnyk, Aurelie Lozano, Payel Das, Vijil Chenthamarakshan

TL;DR
This paper introduces a knowledge distillation approach to create a faster, differentiable protein folding model based on AlphaFold, improving inverse protein design by enhancing sequence recovery and diversity.
Contribution
The authors develop a distilled, efficient folding model using confidence metrics from AlphaFold, enabling its integration into inverse protein design workflows.
Findings
Up to 3% improvement in sequence recovery.
Up to 45% increase in protein diversity.
Maintains structural consistency in generated sequences.
Abstract
Inverse protein folding, the process of designing sequences that fold into a specific 3D structure, is crucial in bio-engineering and drug discovery. Traditional methods rely on experimentally resolved structures, but these cover only a small fraction of protein sequences. Forward folding models like AlphaFold offer a potential solution by accurately predicting structures from sequences. However, these models are too slow for integration into the optimization loop of inverse folding models during training. To address this, we propose using knowledge distillation on folding model confidence metrics, such as pTM or pLDDT scores, to create a faster and end-to-end differentiable distilled model. This model can then be used as a structure consistency regularizer in training the inverse folding model. Our technique is versatile and can be applied to other design tasks, such as sequence-based…
Peer Reviews
Decision·Submitted to ICLR 2024
- Utilize distillation method for transferring AlphaFold's knowledge into fast, differentiable SC scores. - Implement AFDistill for cost-effective integration of AlphaFold expertise into design models. - Conduct comprehensive experiments.
- I am unclear about the motivation behind this paper, particularly regarding the decision to utilize (distilled) AlphaFold instead of directly using AFDB. For instance, the paper states, "Despite this success, large-scale training is computationally expensive. A more efficient method could be to use a pre-trained forward folding model to guide the training of the inverse folding model." However, I fail to see the efficiency benefits of this approach, as utilizing the AF model (or distilled AF m
1. The insight to distill AlphaFold2 for efficient inference in protein-related tasks is well-motivated. 2. Integration of structure consistency in inverse protein folding is important and should intuitively lead to better performance.
1. One major concern is the lack of technical novelty. AFDistill is based on existing ProtBERT without major modification of the model. 2. Though experimental results show significant gain in diversity of predicted amino acid sequences, the improvements on other metrics (e.g., recovery and perplexity) are trivial.
S1. Novel idea of distilling AlphaFold into a fast and differentiable model (AFDistill) for structural consistency prediction. S2. Elevation in recovery rate is observed though marginal.
W0. Given the marginal improvements, it is unconvincing that using plDDTs as loss (L_sc) for sequence design is really useful or not. W1. The extra compute resources costs are significant in this plan, including both the distillation and the extra cost in evaluating / backpropagating L_sc. No justification of these extra computational costs is presented. W2. A generated sequence with high plDDT generally means that it is *conservative* (easy to be predicted), but this is not an indicator that
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Machine Learning in Bioinformatics · Enzyme Structure and Function
MethodsAlphaFold · Knowledge Distillation
