NbBench: Benchmarking Language Models for Comprehensive Nanobody Tasks
Yiming Zhang, Koji Tsuda

TL;DR
NbBench is a comprehensive benchmark suite designed to evaluate and compare various language models on diverse nanobody-related tasks, addressing a key gap in nanobody modeling research.
Contribution
The paper introduces NbBench, the first unified benchmark for nanobody representation learning across multiple biologically relevant tasks and datasets.
Findings
Antibody language models perform well on antigen-related tasks.
Performance on thermostability and affinity prediction remains challenging.
No single model outperforms others across all tasks.
Abstract
Nanobodies -- single-domain antibody fragments derived from camelid heavy-chain-only antibodies -- exhibit unique advantages such as compact size, high stability, and strong binding affinity, making them valuable tools in therapeutics and diagnostics. While recent advances in pretrained protein and antibody language models (PPLMs and PALMs) have greatly enhanced biomolecular understanding, nanobody-specific modeling remains underexplored and lacks a unified benchmark. To address this gap, we introduce NbBench, the first comprehensive benchmark suite for nanobody representation learning. Spanning eight biologically meaningful tasks across nine curated datasets, NbBench encompasses structure annotation, binding prediction, and developability assessment. We systematically evaluate eleven representative models -- including general-purpose protein LMs, antibody-specific LMs, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMonoclonal and Polyclonal Antibodies Research · vaccines and immunoinformatics approaches · Biochemical and Structural Characterization
