# A shrinkage-based statistical method for testing group mean differences in quantitative bottom-up proteomics

**Authors:** Namgil Lee, Hojin Yoo, Juhyoung Kim, Heejung Yang

PMC · DOI: 10.1186/s12859-025-06275-1 · BMC Bioinformatics · 2025-10-31

## TL;DR

The paper introduces a new statistical method for analyzing proteomics data that improves accuracy in detecting changes in peptide quantities.

## Contribution

A novel probabilistic graphical model and statistical method that uses shrinkage estimation and bootstrap techniques for better performance in small sample sizes.

## Key findings

- The proposed method outperforms classical methods in specificity, sensitivity, and accuracy with simulated data resembling real MS data.
- The method effectively identifies peptides with mean quantity changes in real DIA-MS data after treatment with Staurosporine.
- The new approach is particularly effective under small sample size conditions.

## Abstract

In bottom-up proteomics using data-independent acquisition mass spectrometry (DIA-MS), quantitative measurements are obtained following multiple steps of protein fragmentation and ionization, which introduces cumulative errors and impairs the effectiveness of classical statistical methods. This study proposes an alternative statistical approach for testing group mean differences at the peptide level in quantitative bottom-up proteomics.

We present a novel probabilistic graphical model, that accounts for the non-normality of empirical distributions and the correlations between fragment ion quantities. Based on the model, we propose a new statistical method that improves upon the classical feature-based approach by incorporating distribution-free shrinkage estimation of covariance matrices and bootstrap-based estimation of degrees-of-freedom. Simulated experiments demonstrate that the proposed method outperforms the four most widely used classical methods in terms of specificity, sensitivity, and accuracy, particularly when the data distribution closely resembles real MS data, and under conditions of small sample sizes. Numerical analysis of real quantitative tandem mass spectrometry data reveals that the proposed method effectively identifies candidate peptides exhibiting changes in mean quantity following treatment with the kinase inhibitor Staurosporine.

The proposed statistical method offers an effective alternative to classical approaches for differential analysis of peptides in quantitative bottom-up proteomics using DIA-MS. The R software package MDstatsDIAMS is available at https://github.com/namgillee/MDstatsDIAMS.

## Linked entities

- **Chemicals:** Staurosporine (PubChem CID 5279)

## Full-text entities

- **Chemicals:** Staurosporine (MESH:D019311)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12577184/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12577184/full.md

## References

3 references — full list in the complete paper: https://tomesphere.com/paper/PMC12577184/full.md

---
Source: https://tomesphere.com/paper/PMC12577184