# ImmUQBench: a benchmark on uncertainty quantification of protein immunogenicity prediction

**Authors:** Alif Bin Abdul Qayyum, Amir Hossein Rahmati, Xiaoning Qian, Byung-Jun Yoon

PMC · DOI: 10.1093/oxfimm/iqag003 · Oxford Open Immunology · 2026-03-03

## TL;DR

This paper introduces ImmUQBench, a benchmark for evaluating uncertainty quantification methods in predicting protein immunogenicity, aiming to improve the reliability of AI/ML models in therapeutic antigen design.

## Contribution

The novel contribution is the creation of ImmUQBench, a standardized benchmark for assessing UQ methods in immunogenicity prediction under data-scarce conditions.

## Key findings

- Different UQ strategies show varying abilities in capturing predictive uncertainty and maintaining robustness.
- ImmUQBench provides a standardized evaluation framework for model accuracy, calibration, and robustness.
- The benchmark helps identify the most trustworthy UQ methods for immunogenicity prediction.

## Abstract

Discovering antigen proteins, capable of eliciting desired immune responses, is of paramount importance in developing immunogenic therapeutics for combating various diseases, particularly autoimmune disorders, infectious diseases, as well as cancers. Despite recent advances in artificial intelligence (AI) and machine learning (ML), accurate and generalizable immunogenicity prediction remains challenging due to limited labeled data and model over-simplifications. Uncertainty quantification (UQ) approaches are commonly used to address the aforementioned challenges when applying AI/ML methods with limited training data, aiming to reduce the risk of catastrophic errors. This study aims to systematically evaluate the performance of UQ methods for antigen immunogenicity prediction and to establish a benchmark for assessing model reliability in data-scarce setting. We here present ImmUQBench, a comprehensive benchmark that compares several well-known UQ methods for antigen immunogenicity prediction tasks. The benchmark assesses models in terms of predictive accuracy, calibration, and robustness under both in and out of distribution settings, providing standardized evaluation framework. Our evaluation reveals that different UQ strategies exhibit varying capabilities in capturing predictive uncertainty and maintaining robustness. This work yields critical insights into the performance and reliability of various UQ methods when applied to immunogenicity data, helping to identify which methods offer the most trustworthy predictions. ImmUQBench provides a unified platform for assessing UQ approaches in immunogenicity prediction, facilitating the development of more trustworthy AI/ML models for therapeutic antigen design. By offering insights into the strengths and limitations of existing UQ methods, our work facilitates more effective and reliable immunogenic therapeutic discovery.

## Full-text entities

- **Genes:** TRAP [NCBI Gene 100187907], CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, HLA-A (major histocompatibility complex, class I, A) [NCBI Gene 3105] {aka HLAA}, TRBV20OR9-2 (T cell receptor beta variable 20/OR9-2 (non-functional)) [NCBI Gene 6962] {aka CDR3, TCRBV20S2, TCRBV2O, TCRBV2S2O}, FXYD1 (FXYD domain containing ion transport regulator 1) [NCBI Gene 5348] {aka PLM}
- **Diseases:** NLL (MESH:D064726), hallucinations (MESH:D006212), OOD (MESH:D020243), MCD (MESH:D012514), Tumor (MESH:D009369), autoimmune disorders (MESH:D001327), PLMs (MESH:D007806), infectious disease (MESH:D003141)
- **Chemicals:** OOD (-), amino acid (MESH:D000596)
- **Species:** Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12996882/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12996882/full.md

## References

64 references — full list in the complete paper: https://tomesphere.com/paper/PMC12996882/full.md

---
Source: https://tomesphere.com/paper/PMC12996882