TL;DR
LIPNovo introduces a novel approach for de novo peptide sequencing that performs latent space imputation to address missing fragmentation data, significantly improving prediction accuracy over existing methods.
Contribution
The paper proposes LIPNovo, a new paradigm that imputes missing spectral information in the latent space before peptide prediction, enhancing performance in de novo sequencing.
Findings
LIPNovo outperforms state-of-the-art methods on benchmark datasets.
Latent imputation effectively compensates for missing spectral data.
The approach improves peptide sequencing accuracy significantly.
Abstract
De novo peptide sequencing is a fundamental computational technique for ascertaining amino acid sequences of peptides directly from tandem mass spectrometry data, eliminating the need for reference databases. Cutting-edge models usually encode the observed mass spectra into latent representations from which peptides are predicted autoregressively. However, the issue of missing fragmentation, attributable to factors such as suboptimal fragmentation efficiency and instrumental constraints, presents a formidable challenge in practical applications. To tackle this obstacle, we propose a novel computational paradigm called \underline{\textbf{L}}atent \underline{\textbf{I}}mputation before \underline{\textbf{P}}rediction (LIPNovo). LIPNovo is devised to compensate for missing fragmentation information within observed spectra before executing the final peptide prediction. Rather than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
