Exploring Protein Language Model Architecture-Induced Biases for Antibody Comprehension
Mengren (Bill) Liu, Yixiang Zhang, Yiming (Jason) Zhang

TL;DR
This study examines how different protein language model architectures influence their ability to understand antibody-specific features, revealing biases and attention patterns that impact antibody sequence comprehension.
Contribution
It systematically compares architecture-induced biases in antibody-focused PLMs and general models, highlighting their different capabilities in capturing biological features.
Findings
Antibody-specific models focus on CDR regions naturally.
General models benefit from CDR-focused training strategies.
Distinct biases in biological feature capture are linked to model architecture.
Abstract
Recent advances in protein language models (PLMs) have demonstrated remarkable capabilities in understanding protein sequences. However, the extent to which different model architectures capture antibody-specific biological properties remains unexplored. In this work, we systematically investigate how architectural choices in PLMs influence their ability to comprehend antibody sequence characteristics and functions. We evaluate three state-of-the-art PLMs-AntiBERTa, BioBERT, and ESM2--against a general-purpose language model (GPT-2) baseline on antibody target specificity prediction tasks. Our results demonstrate that while all PLMs achieve high classification accuracy, they exhibit distinct biases in capturing biological features such as V gene usage, somatic hypermutation patterns, and isotype information. Through attention attribution analysis, we show that antibody-specific models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMonoclonal and Polyclonal Antibodies Research · vaccines and immunoinformatics approaches · Biomedical Text Mining and Ontologies
