# PepTCR-Net: prediction of multi-class antigen peptides by T-cell receptor sequences with deep learning

**Authors:** Phi Le, Leah Ung, Hai Yang, Anwen Huang, Tao He, Peter Bruno, David Y Oh, Bridget P Keenan, Li Zhang

PMC · DOI: 10.1093/bib/bbaf351 · 2025-07-24

## TL;DR

This paper introduces PepTCR-Net, a deep learning framework that predicts which T-cell receptors recognize antigen peptides, helping in understanding immune responses and disease treatments.

## Contribution

The novel contribution is a two-step framework combining advanced feature engineering and a Bayesian neural network for TCR–antigen prediction.

## Key findings

- The framework shows strong predictive performance using engineered features from TCR and peptide sequences.
- PepTCR-Net successfully predicted TCR recognition of SARS-CoV-2 epitopes in a real-world application.

## Abstract

Predicting T-cell receptor (TCR) recognizing antigen peptides is crucial for understanding the immune system and developing new treatments for cancer, infectious and autoimmune diseases. As experimental methods for identifying TCR–antigen recognition are expensive and time-consuming, machine-learning approaches are increasingly used. However, existing computational tools often struggle with generalization due to limited data and challenges in acquiring true non-recognition pairs and rarely integrate multiple biological features into unified frameworks. To address these challenges, we propose a two-step framework for predicting TCR–antigen recognition. The first step focuses on feature engineering: neural network-based embeddings of letter-based TCR and peptide sequences inspired by language models, and categorical encoding of Human Leukocyte Antigen types and Variable/Joining genes. In the second step, we built a prediction model to assess the likelihood of TRC–antigen recognition by a Bayesian Feedforward Neural Network. We trained and validated the framework using large public databases. Our results demonstrate that our advanced feature engineering delivers strong predictive performance both internally and externally. We applied the framework to a real-world case for predicting whether specific TCRs can recognize SARS-CoV-2 epitope peptides, demonstrating that our framework can function as a de novo TCR–antigen prediction tool applicable to infectious diseases.

## Linked entities

- **Diseases:** cancer (MONDO:0004992), SARS-CoV-2 (MONDO:0100096)

## Full-text entities

- **Genes:** TRBV20OR9-2 (T cell receptor beta variable 20/OR9-2 (non-functional)) [NCBI Gene 6962] {aka CDR3, TCRBV20S2, TCRBV2O, TCRBV2S2O}
- **Diseases:** infectious (MESH:D003141), autoimmune diseases (MESH:D001327), cancer (MESH:D009369)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12286776/full.md

---
Source: https://tomesphere.com/paper/PMC12286776