Regressor-guided Diffusion Model for De Novo Peptide Sequencing with Explicit Mass Control
Shaorong Chen, Jingbo Zhou, Jun Xia

TL;DR
DiffuNovo introduces a diffusion-based model for de novo peptide sequencing that explicitly enforces mass constraints, significantly improving accuracy and physical plausibility over previous methods.
Contribution
It is the first to incorporate a diffusion model with explicit mass control in de novo peptide sequencing, enhancing prediction accuracy and physical plausibility.
Findings
Outperforms state-of-the-art DNPS methods in accuracy.
Reduces mass error significantly, producing more plausible peptides.
Employs a novel mass loss and guidance mechanism during training and inference.
Abstract
The discovery of novel proteins relies on sensitive protein identification, for which de novo peptide sequencing (DNPS) from mass spectra is a crucial approach. While deep learning has advanced DNPS, existing models inadequately enforce the fundamental mass consistency constraint, that a predicted peptide's mass must match the experimental measured precursor mass. Previous DNPS methods often treat this critical information as a simple input feature or use it in post-processing, leading to numerous implausible predictions that do not adhere to this fundamental physical property. To address this limitation, we introduce DiffuNovo, a novel regressor-guided diffusion model for de novo peptide sequencing that provides explicit peptide-level mass control. Our approach integrates the mass constraint at two critical stages: during training, a novel peptide-level mass loss guides model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Machine Learning in Bioinformatics · vaccines and immunoinformatics approaches
