MS2MetGAN: Latent-space adversarial training for metabolite-spectrum matching in MS/MS database search
Meng Tsai, Alexzander Dwyer, Estelle Nuckels, Yingfeng Wang

TL;DR
MS2MetGAN introduces a novel latent-space adversarial training framework using autoencoders and GANs to improve metabolite-spectrum matching accuracy in MS/MS database searches.
Contribution
The paper presents a new method that generates negative training samples via GANs in latent space, enhancing metabolite identification performance.
Findings
MS2MetGAN outperforms existing methods in accuracy.
Latent-space matching improves identification robustness.
GAN-generated decoys enhance training quality.
Abstract
Database search is a widely used approach for identifying metabolites from tandem mass spectra (MS/MS). In this strategy, an experimental spectrum is matched against a user-specified database of candidate metabolites, and candidates are ranked such that true metabolite-spectrum matches receive the highest scores. Machine-learning methods have been widely incorporated into database-search-based identification tools and have substantially improved performance. To further improve identification accuracy, we propose a new framework for generating negative training samples. The framework first uses autoencoders to learn latent representations of metabolite structures and MS/MS spectra, thereby recasting metabolite-spectrum matching as matching between latent vectors. It then uses a GAN to generate latent vectors of decoy metabolites and constructs decoy metabolite-spectrum matches as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Advanced Proteomics Techniques and Applications · Mass Spectrometry Techniques and Applications
