Best practices for machine learning in antibody discovery and development
Leonard Wossnig, Norbert Furtmann, Andrew Buchanan, Sandeep Kumar, and, Victor Greiff

TL;DR
This paper reviews current ML practices in antibody discovery, identifies challenges, and proposes standardized guidelines to improve reproducibility and accelerate therapeutic antibody development.
Contribution
It provides a comprehensive critique of existing ML methods in antibody D&D and offers a set of best practices and evaluation guidelines for the field.
Findings
Highlights diversity in datasets and evaluation metrics hampers comparison
Identifies common pitfalls in ML-based antibody discovery
Recommends standardized protocols for ML model development and assessment
Abstract
Over the past 40 years, the discovery and development of therapeutic antibodies to treat disease has become common practice. However, as therapeutic antibody constructs are becoming more sophisticated (e.g., multi-specifics), conventional approaches to optimisation are increasingly inefficient. Machine learning (ML) promises to open up an in silico route to antibody discovery and help accelerate the development of drug products using a reduced number of experiments and hence cost. Over the past few years, we have observed rapid developments in the field of ML-guided antibody discovery and development (D&D). However, many of the results are difficult to compare or hard to assess for utility by other experts in the field due to the high diversity in the datasets and evaluation techniques and metrics that are across industry and academia. This limitation of the literature curtails the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiosimilars and Bioanalytical Methods · Monoclonal and Polyclonal Antibodies Research · vaccines and immunoinformatics approaches
MethodsSparse Evolutionary Training
