Deciphering antibody affinity maturation with language models and weakly supervised learning
Jeffrey A. Ruffolo, Jeffrey J. Gray, Jeremias Sulam

TL;DR
This paper introduces AntiBERTy, a language model trained on antibody sequences, which helps understand immune repertoires and affinity maturation, potentially aiding in therapeutic antibody discovery.
Contribution
The study presents AntiBERTy, a novel antibody-specific language model trained on 558 million sequences, and demonstrates its ability to cluster antibodies and identify key binding residues.
Findings
AntiBERTy clusters antibodies into affinity maturation trajectories.
Models trained with multiple instance learning identify key binding residues.
Potential to infer antigen binding from repertoire sequences alone.
Abstract
In response to pathogens, the adaptive immune system generates specific antibodies that bind and neutralize foreign antigens. Understanding the composition of an individual's immune repertoire can provide insights into this process and reveal potential therapeutic antibodies. In this work, we explore the application of antibody-specific language models to aid understanding of immune repertoires. We introduce AntiBERTy, a language model trained on 558M natural antibody sequences. We find that within repertoires, our model clusters antibodies into trajectories resembling affinity maturation. Importantly, we show that models trained to predict highly redundant sequences under a multiple instance learning framework identify key binding residues in the process. With further development, the methods presented here will provide new insights into antigen binding from repertoire sequences alone.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMonoclonal and Polyclonal Antibodies Research · vaccines and immunoinformatics approaches · RNA and protein synthesis mechanisms
