# ImmunoPepper: extracting personalized peptides from complex splicing graphs

**Authors:** Laurie Prélot, Jiayu Chen, Matthias Hüser, André Kahles, Gunnar Rätsch

PMC · DOI: 10.1093/bioinformatics/btaf492 · 2025-10-09

## TL;DR

ImmunoPepper is a new tool that identifies cancer-specific peptides from RNA splicing data, which could help in developing personalized cancer treatments.

## Contribution

ImmunoPepper introduces a novel method for extracting personalized peptides from splicing graphs, incorporating germline and somatic variation.

## Key findings

- ImmunoPepper identified an average of 834 and 569 cancer-specific MHC-I binding 9-mers per ovarian and breast cancer sample, respectively.
- MassSpec validation showed an average of 25 and 20 MHC-I binders per ovarian and breast cancer sample, respectively.

## Abstract

RNA sequencing enables the characterization of a cell’s transcript isoforms in healthy and disease conditions. In the context of cancer, local transcript variability may translate to splicing-derived tumor-associated peptides recognized by the immune system. A software tool that extracts such candidate peptides, is of great interest for personalized cancer therapy.

We present the open-source software tool ImmunoPepper, which extracts a set of biologically plausible peptides from a splicing graph, derived from a set of RNA-seq datasets. This peptide set can be personalized with germline and somatic variation and takes novel RNA splice variants into account. ImmunoPepper supports several filtering options, including subtraction of normal tissue background, prediction of MHC-binding affinity, as well as MassSpec-based validation of identified peptides. We analyzed 32 ovarian cancer (TCGA-OV) and 31 breast invasive carcinoma (TCGA-BRCA) samples, with a strict cancer-specific filtering configuration, and obtained on average 834 and 569 cancer-specific predicted MHC-I binding 9-mers per sample, for each cohort, respectively. MassSpec validation with the target-decoy competition Subset-Neighbor-Search (SNS) showed an average validation rate of 4.5% per TCGA-OV sample and 5.3% per TCGA-BRCA sample. This corresponded to 25 MHC-I binders 9-mers per TCGA-OV sample, and 20 MHC-I binders 9-mers per TCGA-BRCA sample in average. Finally, we draw conclusions about the best framework for generation of splicing-derived neoepitopes and recommend to use joint data structures when processing homogeneously a cancer and a normal cohort and to focus on reproducibility of the candidates across generation pipelines.

ImmunoPepper is implemented in Python 3 and is available as open-source software at https://github.com/ratschlab/immunopepper. The online documentation can be found at https://immunopepper.readthedocs.io/en/latest/.

## Linked entities

- **Diseases:** ovarian cancer (MONDO:0005140)

## Full-text entities

- **Genes:** BRCA1 (BRCA1 DNA repair associated) [NCBI Gene 672] {aka BRCAI, BRCC1, BROVCA1, FANCS, IRIS, PNCA4}, HLA-C (major histocompatibility complex, class I, C) [NCBI Gene 3107] {aka D6S204, HLA-JY3, HLAC, HLC-C, MHC, PSORS1}
- **Diseases:** ovarian cancer (MESH:D010051), breast invasive carcinoma (MESH:D001943), cancer (MESH:D009369)

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12777967/full.md

---
Source: https://tomesphere.com/paper/PMC12777967