# Degradation graphs reveal hidden proteolytic activity in peptidomes

**Authors:** Erik Hartman, Johan Malmström, Jonas Wallin, Yanbu Guo, Yanbu Guo, Yanbu Guo, Yanbu Guo

PMC · DOI: 10.1371/journal.pcbi.1013972 · PLOS Computational Biology · 2026-02-20

## TL;DR

This paper introduces degradation graphs, a new method to model protein breakdown as a network, revealing hidden proteolytic activity and improving peptidomics analysis.

## Contribution

Degradation graphs provide a novel probabilistic framework to model proteolysis as a network, enabling accurate quantification of proteolytic flows and enzyme activity.

## Key findings

- Failure to model downstream trimming leads to 3–4-fold underestimation of upstream proteolytic activity.
- Degradation graphs enable machine learning models to capture protease-specific signatures from graph topology and sequence context.
- Traditional analyses underestimate total proteolytic activity by up to fourfold when ignoring degradation dynamics.

## Abstract

Protein degradation is a regulated process that reshapes the proteome and generates bioactive peptides. Peptidomics and degradomics enables large-scale measurement of these peptides, yet most data analyses approaches treat peptides as isolated endpoints rather than intermediates produced by sequential cleavage. Here, we introduce degradation graphs, a probabilistic framework that represents proteolysis as a directed acyclic network of cleavage events with explicit absorption. From single-snapshot peptidomes, we infer graph weights by gradient descent or linear-flow optimization, quantify flows through branches and bottlenecks, and correct a core bias in conventional quantification. Across three biological datasets, failure to model downstream trimming leads to 3–4-fold underestimation of upstream proteolytic activity. Moreover, degradation graphs provide graph-structured features that enable machine learning models to capture protease-specific signatures from both graph topology and sequence context. Taken together, these findings establish explicit degradation modeling as a practical approach to mechanistic and interpretable peptidomics, bridging the fields of degradomics and peptidomics.

Proteins are continuously broken down into smaller fragments called peptides, a process known as proteolysis. This controlled degradation shapes the proteome, regulates signaling, and generates bioactive molecules involved in immunity, inflammation, and disease. Modern mass spectrometry techniques can measure thousands of such peptides, yet most analytical methods treat these peptides as isolated snapshots rather than as part of an ongoing proteolytic process.

In this study, we introduce a framework for modelling protein degradation as a networked process. We use degradation graphs to represent how proteins break down step by step, where each peptide is connected to all fragments it can generate. From experimental peptidomic data, these graphs can be reconstructed to quantify how proteolytic activity flows through the network, allowing more accurate estimation of enzyme activity and identification of bottlenecks in the degradation process.

Using both experimental and clinical datasets, we show that traditional analyses underestimate total proteolytic activity by up to fourfold. By treating the peptidome as a dynamic system rather than a static collection of fragments, degradation graphs bridge the gap between peptidomics and degradomics, offering a mechanistic and interpretable view of how proteins are proteolytically processed in health and disease.

## Full-text entities

- **Genes:** ABCB6 (ATP binding cassette subfamily B member 6 (LAN blood group)) [NCBI Gene 10058] {aka ABC, LAN, MTABC3, PRP, umat}, APOA1 (apolipoprotein A1) [NCBI Gene 335] {aka AMYLD3, HPALP2, apo(a)}, F2 (coagulation factor II, thrombin) [NCBI Gene 2147] {aka PT, RPRGL2, THPH1}, FGB (fibrinogen beta chain) [NCBI Gene 2244] {aka HEL-S-78p}, HBB (hemoglobin subunit beta) [NCBI Gene 3043] {aka CD113t-C, ECYT6, beta-globin}, UMOD (uromodulin) [NCBI Gene 7369] {aka ADMCKD2, ADTKD1, FJHN, HNFJ, HNFJ1, MCKD2}, POTEF (POTE ankyrin domain family member F) [NCBI Gene 728378] {aka A26C1B, POTE2alpha, POTEACTIN}
- **Diseases:** wound infection (MESH:D014946), infection (MESH:D007239), HBA (MESH:D013661), bacterial infections (MESH:D001424), diabetes (MESH:D003920), cancer (MESH:D009369), neurodegeneration (MESH:D019636), inflammation (MESH:D007249)
- **Chemicals:** BioRender (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Staphylococcus aureus (species) [taxon 1280], Pseudomonas aeruginosa (species) [taxon 287]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12923037/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12923037/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC12923037/full.md

---
Source: https://tomesphere.com/paper/PMC12923037