De Novo Molecular Generation from Mass Spectra via Many-Body Enhanced Diffusion
Xichen Sun, Wentao Wei, Jiahua Rao, Jiancong Xie, Yuedong Yang

TL;DR
This paper introduces MBGen, a novel many-body enhanced diffusion framework that leverages higher-order interactions in mass spectrometry data to improve de novo molecular structure generation and isomer differentiation.
Contribution
The paper presents a new many-body attention mechanism integrated into a diffusion model, capturing complex interactions in MS/MS spectra for better molecular generation.
Findings
MBGen outperforms state-of-the-art methods by up to 230%.
It effectively captures higher-order interactions in mass spectra.
The approach improves isomer differentiation and structural accuracy.
Abstract
Molecular structure generation from mass spectrometry is fundamental for understanding cellular metabolism and discovering novel compounds. Although tandem mass spectrometry (MS/MS) enables the high-throughput acquisition of fragment fingerprints, these spectra often reflect higher-order interactions involving the concerted cleavage of multiple atoms and bonds-crucial for resolving complex isomers and non-local fragmentation mechanisms. However, most existing methods adopt atom-centric and pairwise interaction modeling, overlooking higher-order edge interactions and lacking the capacity to systematically capture essential many-body characteristics for structure generation. To overcome these limitations, we present MBGen, a Many-Body enhanced diffusion framework for de novo molecular structure Generation from mass spectra. By integrating a many-body attention mechanism and higher-order…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMass Spectrometry Techniques and Applications · Protein Structure and Dynamics · Metabolomics and Mass Spectrometry Studies
