A nested mixture model for protein identification using mass   spectrometry

Qunhua Li; Michael J. MacCoss; Matthew Stephens

arXiv:1011.2087·stat.AP·November 10, 2010

A nested mixture model for protein identification using mass spectrometry

Qunhua Li, Michael J. MacCoss, Matthew Stephens

PDF

TL;DR

This paper introduces a nested mixture model for mass spectrometry data that improves protein and peptide identification accuracy by integrating the identification process into a single statistical framework, outperforming existing methods.

Contribution

The paper proposes a novel one-stage nested mixture model that jointly infers proteins and peptides, enhancing accuracy over traditional two-stage approaches.

Findings

01

Single-stage approach yields more accurate peptide identification.

02

Comparable protein identification accuracy with existing methods, with some scenarios favoring the new model.

03

Model performs well on simulated and real yeast data.

Abstract

Mass spectrometry provides a high-throughput way to identify proteins in biological samples. In a typical experiment, proteins in a sample are first broken into their constituent peptides. The resulting mixture of peptides is then subjected to mass spectrometry, which generates thousands of spectra, each characteristic of its generating peptide. Here we consider the problem of inferring, from these spectra, which proteins and peptides are present in the sample. We develop a statistical approach to the problem, based on a nested mixture model. In contrast to commonly used two-stage approaches, this model provides a one-stage solution that simultaneously identifies which proteins are present, and which peptides are correctly identified. In this way our model incorporates the evidence feedback between proteins and their constituent peptides. Using simulated data and a yeast data set, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.