A likelihood-based scoring method for peptide identification using mass spectrometry
Qunhua Li, Jimmy K. Eng, Matthew Stephens

TL;DR
This paper introduces a likelihood-based probabilistic scoring method for peptide identification in mass spectrometry, improving accuracy and uncertainty assessment over existing heuristic approaches by modeling spectral data more comprehensively.
Contribution
It presents a new probabilistic model for peptide-spectrum matching that incorporates peak details, providing a more principled and accurate scoring criterion than traditional heuristic methods.
Findings
Outperforms existing methods like SEQUEST on benchmark data.
Provides natural measures for uncertainty in peptide identification.
Enhances peptide scoring with detailed spectral modeling.
Abstract
Mass spectrometry provides a high-throughput approach to identify proteins in biological samples. A key step in the analysis of mass spectrometry data is to identify the peptide sequence that, most probably, gave rise to each observed spectrum. This is often tackled using a database search: each observed spectrum is compared against a large number of theoretical "expected" spectra predicted from candidate peptide sequences in a database, and the best match is identified using some heuristic scoring criterion. Here we provide a more principled, likelihood-based, scoring criterion for this problem. Specifically, we introduce a probabilistic model that allows one to assess, for each theoretical spectrum, the probability that it would produce the observed spectrum. This probabilistic model takes account of peak locations and intensities, in both observed and theoretical spectra, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
