Improving Antibody Design with Force-Guided Sampling in Diffusion Models
Paulina Kulyt\.e, Francisco Vargas, Simon Valentin Mathis, Yu Guang, Wang, Jos\'e Miguel Hern\'andez-Lobato, Pietro Li\`o

TL;DR
This paper introduces DiffForce, a novel diffusion model that incorporates force-field energy feedback to improve antibody CDR design, resulting in more accurate and energetically favorable antibody structures.
Contribution
The paper presents a new method integrating force-guided sampling into diffusion models for antibody design, addressing dataset limitations and improving structure quality.
Findings
Guides diffusion models to generate lower-energy antibody structures.
Enhances both structural accuracy and sequence quality of designed antibodies.
Demonstrates improved generalization to out-of-distribution interfaces.
Abstract
Antibodies, crucial for immune defense, primarily rely on complementarity-determining regions (CDRs) to bind and neutralize antigens, such as viruses. The design of these CDRs determines the antibody's affinity and specificity towards its target. Generative models, particularly denoising diffusion probabilistic models (DDPMs), have shown potential to advance the structure-based design of CDR regions. However, only a limited dataset of bound antibody-antigen structures is available, and generalization to out-of-distribution interfaces remains a challenge. Physics based force-fields, which approximate atomic interactions, offer a coarse but universal source of information to better mold designs to target interfaces. Integrating this foundational information into diffusion models is, therefore, highly desirable. Here, we propose a novel approach to enhance the sampling process of diffusion…
Peer Reviews
Decision·ICLR 2025 Conference Withdrawn Submission
This paper focuses on important subject - sampling sequence and corresponding structure of complementarity determining regions (CDRs) of antibodies in the context of binding partner (antigen). Improving this process can have important implications in drug discovery. The main original contribution of the paper is DiffForce - an model and algorithm which allows for guidance of the generation process with differentiable implementation of force field thus potentially allowing to skew the samples tow
While I believe that the idea and implementation described in the paper is valuable and has a significant potential I am not convinced by the evaluation. The paper has strong and general claims on "improving antibody design" and offering "enhanced quality of produced antibody sequences" but the results focus mostly on improvements of binding energies (estimated with the orthogonal, non-differentiable Rosetta force field). Sequence recovery and RMSD metrics are valuable metrics too but their inte
The idea of incorporating information of a physics-based force field into ML-based sampling seems very promising, and could lead to substantial improvements in practical applicability of these methods which often generate clashes or physically impossible structures.
The benchmarking is relatively limited, comparing only to DiffAb, a very similar model without the guided sampling, and RAbD, a physics-based method. It would be useful to include comparisons with other recent ML work, such as dyMEAN, HERN, RFdiffusion or IgDiff. In table 1, it would be helpful to include standard deviation for each metric, to understand the statistical significance of these results, especially as numbers are given to 2 or 3 decimal points. Though the incorporation of force fiel
• The paper outlines a potentially promising approach for combining MD simulations with pretrained diffusion models. • The theory seems to be well-motivated, assuming we can estimate s0 and O0 effectively • The improved energy scores show that the addition of MD is indeed resulting in generated CDRs with lower energy. • The sequence recovery is marginally better than that of DiffAb.
• While the energy demonstrations are important, they are not surprising considering the only difference between this and DiffAb should be improved energy minimization. It would also be useful to see plots in aggregate rather than individual structures which can be cherry-picked. • An important part of this work is the estimation of the final residues and orientations, however the equations for these seem to depend on information which should not be available at inference time. The incorpo
The paper is clear and easy to follow.
1. The method is a straightforward application of diffusion with guidance, and the theoretical derivations in the manuscript are simply replications of those from the original diffusion guidance method. 2. The method is compared with only a few baseline approaches on a limited set of tasks, despite many recent advancements in this area.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMonoclonal and Polyclonal Antibodies Research · Protein purification and stability · Viral Infectious Diseases and Gene Expression in Insects
MethodsDiffusion
