# SWAPS: A Modular Deep-Learning Empowered Peptide Identity Propagation Framework Beyond Match-Between-Run

**Authors:** Zixuan Xiao, Johanna Tüshaus, Bernhard Kuster, Matthew The, Mathias Wilhelm

PMC · DOI: 10.1021/acs.jproteome.4c00972 · 2025-03-07

## TL;DR

SWAPS is a new framework that improves peptide identification in proteomics by using MS1 data and deep learning, outperforming traditional methods in varied experimental conditions.

## Contribution

SWAPS introduces a modular, MS1-centric framework for peptide identity propagation that works across diverse LC gradients and improves identification rates significantly.

## Key findings

- SWAPS increases precursor identification by 46.3-112.1% over MS2-based methods in different LC gradients.
- SWAPS maintains quantitative accuracy while effectively deconvoluting MS1 signals.
- Current peptide property prediction models are not yet fully comparable to experimental data, highlighting a need for improvement.

## Abstract

Mass spectrometry (MS)-based proteomics relies heavily
on MS/MS
(MS2) data, which do not fully exploit the available MS1 information.
Traditional peptide identity propagation (PIP) methods, such as match-between-runs
(MBR), are limited to similar runs, particularly with the same liquid
chromatography (LC) gradients, thus potentially underutilizing available
proteomics libraries. We introduce SWAPS, a novel and modular MS1-centric
framework incorporating advances in peptide property prediction, extensive
proteomics libraries, and deep-learning-based postprocessing to enable
and explore PIP across more diverse experimental conditions and LC
gradients. SWAPS substantially enhances precursor identification,
especially in shorter gradients. On the example of 30, 15, and 7.5
min gradients, SWAPS achieves increases of 46.3, 86.2, and 112.1%
on precursor level over MaxQuant’s MS2-based identifications.
Despite the inherent challenges in controlling false discovery rates
(FDR) with MS1-based methods, SWAPS demonstrates strong efficacy in
deconvoluting MS1 signals, offering powerful discrimination and deeper
sequence exploration, while maintaining quantitative accuracy. By
building on and applying peptide property predictions in practical
contexts, SWAPS reveals that current models, while advanced, are still
not fully comparable to experimental measurements, sparking the need
for further research. Additionally, its modular design allows seamless
integration of future improvements, positioning SWAPS as a forward-looking
tool in proteomics.

## Full-text entities

- **Genes:** XIC (X chromosome inactivation center) [NCBI Gene 7502] {aka SXI1, XCE, XIST}, MS1 [NCBI Gene 4397], PIP (prolactin induced protein) [NCBI Gene 5304] {aka BRST-2, GCDFP-15, GCDFP15, GPIP4}
- **Chemicals:** H2O (MESH:D014867), DIA-NN (-), methionine (MESH:D008715), ACN (MESH:C084683), amino acid (MESH:D000596), peptide (MESH:D010455), FA (MESH:D005492), DMSO (MESH:D004121)
- **Species:** Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Homo sapiens (human, species) [taxon 9606], Escherichia coli K-12 (strain) [taxon 83333], Escherichia coli (E. coli, species) [taxon 562]
- **Cell lines:** HeLa — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_0030)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11976850/full.md

---
Source: https://tomesphere.com/paper/PMC11976850