Exploring Protein Language Model Architecture-Induced Biases for Antibody Comprehension

Mengren (Bill) Liu; Yixiang Zhang; Yiming (Jason) Zhang

arXiv:2512.09894·cs.LG·December 11, 2025

Exploring Protein Language Model Architecture-Induced Biases for Antibody Comprehension

Mengren (Bill) Liu, Yixiang Zhang, Yiming (Jason) Zhang

PDF

Open Access

TL;DR

This study examines how different protein language model architectures influence their ability to understand antibody-specific features, revealing biases and attention patterns that impact antibody sequence comprehension.

Contribution

It systematically compares architecture-induced biases in antibody-focused PLMs and general models, highlighting their different capabilities in capturing biological features.

Findings

01

Antibody-specific models focus on CDR regions naturally.

02

General models benefit from CDR-focused training strategies.

03

Distinct biases in biological feature capture are linked to model architecture.

Abstract

Recent advances in protein language models (PLMs) have demonstrated remarkable capabilities in understanding protein sequences. However, the extent to which different model architectures capture antibody-specific biological properties remains unexplored. In this work, we systematically investigate how architectural choices in PLMs influence their ability to comprehend antibody sequence characteristics and functions. We evaluate three state-of-the-art PLMs-AntiBERTa, BioBERT, and ESM2--against a general-purpose language model (GPT-2) baseline on antibody target specificity prediction tasks. Our results demonstrate that while all PLMs achieve high classification accuracy, they exhibit distinct biases in capturing biological features such as V gene usage, somatic hypermutation patterns, and isotype information. Through attention attribution analysis, we show that antibody-specific models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMonoclonal and Polyclonal Antibodies Research · vaccines and immunoinformatics approaches · Biomedical Text Mining and Ontologies