# SVNeoPP: A Workflow for Structural-Variant-Derived Neoantigen Prediction and Prioritization Using Multi-Omics Data

**Authors:** Wanyang An, Xiaoxiu Tan, Zhenhao Liu, Li Zou, Manman Lu, Lu Xie

PMC · DOI: 10.3390/biology15060492 · Biology · 2026-03-19

## TL;DR

SVNeoPP is a new tool that identifies and prioritizes neoantigens from structural variants in tumors using multi-omics data, expanding the range of potential targets for cancer immunotherapy.

## Contribution

SVNeoPP introduces a reproducible workflow for predicting and prioritizing neoantigens derived from structural variants, integrating multi-dimensional evidence for improved accuracy.

## Key findings

- SVNeoPP successfully generated a high-confidence list of candidate neoantigens from hepatocellular carcinoma multi-omics data.
- Compared to other methods, SVNeoPP expanded the search space for SV-derived neoantigens and improved antigen-processing and HLA binding features.
- The tool integrates transcriptomic, proteomic, and immunogenicity data to prioritize neoantigens with interpretable evidence.

## Abstract

Structural variants (SVs) are widespread in tumors and can generate protein fragments with larger tumor–normal differences through genomic rearrangements or gene fusion events, making them an important source of neoantigens. In this study, we developed SVNeoPP (Structural Variant Neoantigen Prediction and Prioritization), a reproducible analytical workflow that converts SV-breakpoint information into peptide sequences, predicts candidate neoantigens, and prioritizes them through stepwise filtering supported by multi-dimensional evidence, including transcriptomic and proteomic data. In a proof-of-concept application to hepatocellular carcinoma (HCC) multi-omics datasets, SVNeoPP produced a high-confidence, high-priority shortlist of candidate neoantigens. SVNeoPP may be complemented as a useful bioinformatics tool for human genome structure and function analysis.

Background: Tumor neoantigens are key targets for personalized vaccines and T-cell therapies, yet most pipelines focus on neoantigens derived from SNV/small indel and often yield a limited number of high-quality candidates. SVs are prevalent in tumors and can generate novel chimeric sequences and neopeptides, making them a promising additional source of neoantigens. However, SV-derived neoantigen prediction remains challenging due to breakpoint uncertainty, isoform-dependent coding inference, and limited integration of multi-dimensional evidence and reproducibility. Methods: We developed SVNeoPP (Structural Variant Neoantigen Prediction and Prioritization), an end-to-end workflow for SV-derived neoantigen analysis. SVNeoPP takes WGS and RNA-seq as inputs, performs SV calling and annotation, and reconstructs altered transcripts and coding sequences in a traceable, isoform-aware manner to generate candidate peptides. Candidates are prescreened by integrating antigen-processing features with HLA binding prediction, and then hierarchically filtered and prioritized based on transcript expression, LC–MS/MS proteomics evidence, immunogenicity predictions, and sequence similarity to experimentally validated neoantigen databases. SVNeoPP is implemented in Snakemake to enable modular extension, checkpoint-based restarts, and end-to-end reproducibility. Results: Using a hepatocellular carcinoma (HCC) multi-omics dataset as a proof of concept, we demonstrated the performance of SVNeoPP and obtained a high-priority shortlist of candidate peptides. Compared with other methods, SVNeoPP substantially expanded the candidate search space for SV-derived neoantigens and showed more favorable distributions of antigen-processing and HLA binding features. Conclusions: SVNeoPP provides a reusable, traceable, and interpretable multi-dimensional evidence-driven framework for SV-derived neoantigens. As a complementary module to SNV/small-indel pipelines, it broadens the neoantigen candidate repertoire and generates ranked candidates with interpretable evidence to facilitate downstream prioritization and decision-making.

## Linked entities

- **Diseases:** hepatocellular carcinoma (MONDO:0007256)

## Full-text entities

- **Genes:** HLA-C (major histocompatibility complex, class I, C) [NCBI Gene 3107] {aka D6S204, HLA-JY3, HLAC, HLC-C, MHC, PSORS1}, HLA-B (major histocompatibility complex, class I, B) [NCBI Gene 3106] {aka AS, B-4901, HLAB}, TTN (titin) [NCBI Gene 7273] {aka CMD1G, CMH9, CMPD4, CMYO5, CMYP5, EOMFC}, HLA-A (major histocompatibility complex, class I, A) [NCBI Gene 3105] {aka HLAA}, POSTN (periostin) [NCBI Gene 10631] {aka OSF-2, OSF2, PDLPOSTN, PN}, COL1A1 (collagen type I alpha 1 chain) [NCBI Gene 1277] {aka CAFYD, EDSARTH1, EDSC, OI1, OI2, OI3}, CPS1 (carbamoyl-phosphate synthase 1) [NCBI Gene 1373] {aka CPS1D, CPSASE1, GATD6, PHN}
- **Diseases:** toxicity (MESH:D064420), HCC (MESH:D006528), SVNeoPP (MESH:D020914), injury to (MESH:D014947), VCF (MESH:D004062), Tumor (MESH:D009369)
- **Chemicals:** NeoSV (-), methionine (MESH:D008715), cysteine (MESH:D003545)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13024079/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13024079/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/PMC13024079/full.md

---
Source: https://tomesphere.com/paper/PMC13024079