# TCRLens: structure-aware equivariant graph learning for TCR-pMHC-I recognition and immunogenic epitope discovery

**Authors:** Paopit Siriarchawatana, Supawadee Ingsriswang, Challika Kaewborisuth, Anan Jongkaewwattana

PMC · DOI: 10.1093/bioadv/vbag066 · Bioinformatics Advances · 2026-02-24

## TL;DR

TCRLens is a deep learning framework that improves prediction of TCR recognition of pMHC-I complexes using structural data and generative models.

## Contribution

TCRLens introduces a structure-aware EGNN and a VAE-GAN for data augmentation, outperforming existing methods in TCR-pMHC-I recognition.

## Key findings

- TCRLens outperforms state-of-the-art methods in predicting TCR-pMHC-I interactions.
- The model shows robust cross-species generalization in swine and chicken MHC-I systems.
- Generative data augmentation improves performance in the face of data sparsity and class imbalance.

## Abstract

Accurate prediction of T-cell receptor (TCR) recognition of peptide-MHC class I (pMHC-I) complexes is a key challenge due to structural diversity and data sparsity. We introduce TCRLens, a structure-aware deep learning framework that models residue-level interactions across five critical interface zones using multi-scale graph representations and an equivariant graph neural network (EGNN). To mitigate data sparsity and severe class imbalance arising from limited negative samples, TCRLens incorporates a variational autoencoder-generative adversarial network (VAE-GAN) to generate structurally plausible weak-affinity interaction samples. We evaluated TCRlens across three prediction tasks including peptide-MHC binding, peptide-TCR recognition, and full-complex TCR-pMHC-I interaction and observed consistently strong performance.

Using curated dataset of human TCR-pMHC-I structural complexes from ATLAS and TCR3d, TCRLens outperforms state-of-the-art sequence-based, motif-based, and structure-aware methods, including NetMHCpan 4.2, CapsNet-MHC, RPEMHC, NetTCR-2.0, TITAN, PanPep, pMTnet, ERGO II, and STAG. Furthermore, TCRLens demonstrates robust cross-species generalization, achieving high predictive performance in swine and chicken MHC-I systems. These findings highlight the value of geometry-aware representation learning and generative data augmentation for capturing immunological specificity. TCRLens provides a unified and extensible platform for TCR-pMHC-I interaction modeling, with potential applications in epitope discovery and structure-guided vaccine design across both human and veterinary immunology.

The code used in this study is publicly available at https://github.com/paopitsiri/TCRLens.

## Linked entities

- **Proteins:** Tcr (Third chromosome alpha methyl dopa-resistant), HLA-C (major histocompatibility complex, class I, C)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Genes:** TRBV20OR9-2 (T cell receptor beta variable 20/OR9-2 (non-functional)) [NCBI Gene 6962] {aka CDR3, TCRBV20S2, TCRBV2O, TCRBV2S2O}, HLA-C (major histocompatibility complex, class I, C) [NCBI Gene 3107] {aka D6S204, HLA-JY3, HLAC, HLC-C, MHC, PSORS1}
- **Species:** Sus scrofa (pig, species) [taxon 9823], Homo sapiens (human, species) [taxon 9606], Gallus gallus (bantam, species) [taxon 9031]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13012892/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13012892/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/PMC13012892/full.md

---
Source: https://tomesphere.com/paper/PMC13012892