# AI-driven computational methods and benchmarking for T-cell antigen identification

**Authors:** Yang Deng, Jinhao Que, Guangfu Xue, Yideng Cai, Wenyi Yang, Yilin Wang, Yi Hui, Zuxiang Wang, Yi Lin, Wenyang Zhou, Zhaochun Xu, Qinghua Jiang, Haoxiu Sun

PMC · DOI: 10.1093/bib/bbag123 · Briefings in Bioinformatics · 2026-03-17

## TL;DR

This paper reviews AI methods for identifying T-cell antigens and finds that current models struggle with predicting new epitope variants, highlighting the need for better computational approaches.

## Contribution

The paper provides a comprehensive survey and benchmarking of AI-driven methods for T-cell antigen identification, revealing significant generalization gaps in current models.

## Key findings

- Current TCR–pMHC prediction models show a significant generalization gap when tested on out-of-distribution epitope variants.
- Enhanced structural modeling and integration of multi-omics data are urgently needed to improve prediction accuracy.
- Generative models for de novo TCR design could help overcome current limitations in antigen prediction.

## Abstract

The rise of mRNA vaccines highlights the pivotal role of T-cell antigen identification in modern vaccinology and personalized medicine. T-cell recognition relies on the sophisticated ternary interaction between the T-cell receptor (TCR), the major histocompatibility complex (MHC) molecule, and the peptide antigen, which forms the peptide–MHC (pMHC) complex. Computational methods, particularly artificial intelligence (AI), are indispensable for accurately predicting these complex bindings. This review systematically surveys the rapidly evolving AI-driven landscape for T-cell antigen identification, providing a comprehensive categorization of methods for MHC-I, MHC-II, and the highly complex TCR–pMHC binding prediction, alongside foundational data resources. Crucially, we conduct a rigorous, standardized benchmarking of 18 state-of-the-art TCR–pMHC prediction models across diverse training data sources. Our evaluation on two distinct and challenging out-of-distribution (OOD) unseen epitope variant datasets reveals a significant and concerning generalization gap in current predictors. Notably, the overall absolute predictive gain remains marginal across all models under OOD conditions. This result underscores a severe and persistent generalization challenge when faced with novel epitope variants. To address these limitations, we emphasize the urgent need for enhanced structural modeling, the integration of multi-omics data, and the development of generative models for de novo TCR design. By advancing these computational frontiers, our community can accelerate the transition from prediction to rational design in immunoinformatics.

## Full-text entities

- **Genes:** CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, B2M (beta-2-microglobulin) [NCBI Gene 567] {aka AMYLD6, IMD43, MHC1D4}, HLA-A (major histocompatibility complex, class I, A) [NCBI Gene 3105] {aka HLAA}, HLA-B (major histocompatibility complex, class I, B) [NCBI Gene 3106] {aka AS, B-4901, HLAB}, HLA-C (major histocompatibility complex, class I, C) [NCBI Gene 3107] {aka D6S204, HLA-JY3, HLAC, HLC-C, MHC, PSORS1}, HLA-DPB1 (major histocompatibility complex, class II, DP beta 1) [NCBI Gene 3115] {aka DPB1, HLA-DP, HLA-DP1B, HLA-DPB}, TRBV20OR9-2 (T cell receptor beta variable 20/OR9-2 (non-functional)) [NCBI Gene 6962] {aka CDR3, TCRBV20S2, TCRBV2O, TCRBV2S2O}, APC (APC regulator of Wnt signaling pathway) [NCBI Gene 324] {aka BTPS2, DESMD, DP2, DP2.5, DP3, GS}
- **Diseases:** pancreatic cancer (MESH:D010190), colorectal cancer (MESH:D015179), OOD (MESH:D020243), melanoma (MESH:D008545), cancer (MESH:D009369), glioblastoma (MESH:D005909), MSD (MESH:D052517), infectious diseases (MESH:D003141), COVID-19 (MESH:D000086382)
- **Chemicals:** BK (-), amino acid (MESH:D000596)
- **Species:** Homo sapiens (human, species) [taxon 9606], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12993716/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12993716/full.md

## References

161 references — full list in the complete paper: https://tomesphere.com/paper/PMC12993716/full.md

---
Source: https://tomesphere.com/paper/PMC12993716