Conditional generation of antibody sequences with classifier-guided germline-absorbing discrete diffusion
Justin Sanders, Luca Giancardo, Lan Guo, Yue Zhao, Kemal Sonmez, Nina Cheng, Melih Yilmaz

TL;DR
This paper introduces a novel discrete diffusion model for antibody sequence generation that incorporates germline information and supports classifier-guided conditional sampling, improving biological relevance and property optimization.
Contribution
The authors develop germline absorbing diffusion, a biologically motivated modification that enhances antibody sequence modeling and enables flexible, conditioned generation.
Findings
Germline diffusion improves non-germline residue prediction accuracy from 26% to 46%.
The model outperforms EvoProtGrad in balancing class adherence and sample quality.
Conditional generation effectively produces antibodies with desired properties like hydrophobicity and binding affinity.
Abstract
Antibody therapeutics are among the most successful modern medicines, yet computationally designing antibodies with desirable binding and developability properties remains challenging. While protein language models (pLMs) have emerged as powerful tools for antibody sequence design, existing approaches largely suffer from two key limitations: they predominantly memorize germline sequences rather than modeling biologically meaningful somatic variation, and they offer limited support for flexible classifier-guided conditional generation. We address these challenges through two primary contributions. First, we demonstrate that discrete diffusion fine-tuning achieves strong language modeling performance on antibody sequences while allowing for generation conditioned on any off-the-shelf classifier. Second, we introduce germline absorbing diffusion, a novel modification of the discrete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
