# ALLM-Ab: Active Learning-Driven Antibody Optimization Using Fine-Tuned Protein Language Models

**Authors:** Kairi Furui, Masahito Ohue

PMC · DOI: 10.1021/acs.jcim.5c01577 · Journal of Chemical Information and Modeling · 2025-10-22

## TL;DR

ALLM-Ab is a new method that uses advanced AI to optimize antibodies for better performance and developability in drug design.

## Contribution

Introduces ALLM-Ab, an active learning framework using fine-tuned protein language models for efficient antibody optimization.

## Key findings

- ALLM-Ab outperforms baseline methods in discovering high-affinity antibody variants.
- The framework preserves critical developability metrics during optimization.
- Validated using DMS data and active learning trials across 15 antigens.

## Abstract

Antibody engineering requires a delicate balance between
enhancing
binding affinity and maintaining developability properties. In this
study, we present ALLM-Ab (Active Learning with Language Models for
Antibodies), a novel active learning framework that leverages fine-tuned
protein language models to accelerate antibody sequence optimization.
By employing parameter-efficient fine-tuning via low-rank adaptation,
coupled with a learning-to-rank strategy, ALLM-Ab accurately assesses
mutant fitness while efficiently generating candidate sequences through
direct sampling from the model’s probability distribution.
Furthermore, by integrating a multiobjective optimization scheme incorporating
antibody developability metrics, the framework ensures that optimized
sequences retain therapeutic antibody-like properties alongside improved
binding affinity. We validate ALLM-Ab in both offline experiments
using deep mutational scanning (DMS) data from the BindingGYM data
set and online active learning trials targeting Flex ddG energy minimization
across 15 antigens. Results demonstrate that ALLM-Ab not only expedites
the discovery of high-affinity variants compared to baseline Gaussian
process regression and genetic algorithm-based approaches, but also
preserves critical antibody developability metrics. This work lays
the foundation for more efficient and reliable antibody design strategies,
with the potential to significantly reduce therapeutic development
costs.

## Full-text entities

- **Genes:** VEGFA (vascular endothelial growth factor A) [NCBI Gene 7422] {aka L-VEGF, MVCD1, VEGF, VPF}, ERBB2 (erb-b2 receptor tyrosine kinase 2) [NCBI Gene 2064] {aka CD340, HER-2, HER-2/neu, HER2, MLN 19, MLN-19}, ANGPT2 (angiopoietin 2) [NCBI Gene 285] {aka AGPT2, ANG2, LMPHM10}, SMOC1 (SPARC related modular calcium binding 1) [NCBI Gene 64093] {aka OAS}
- **Diseases:** IDDM (MESH:D003922), Ab (MESH:D000089965), pLMs (MESH:D007806)
- **Chemicals:** APH (MESH:C054801), IP (MESH:C041508), amino acids (MESH:D000596), phenylalanine (MESH:D010649), AP (MESH:D000667), cysteine (MESH:D003545), ALLM (MESH:C069861), 5A12 (-), methionine (MESH:D008715)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12606632/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12606632/full.md

## References

76 references — full list in the complete paper: https://tomesphere.com/paper/PMC12606632/full.md

---
Source: https://tomesphere.com/paper/PMC12606632