Unlocking High-Fidelity Molecular Generation from Mass Spectra via Dual-Stream Line Graph Diffusion
Xujun Che, Xiuxia Du, Depeng Xu

TL;DR
This paper introduces DualLGD, a dual-stream graph diffusion model that improves molecular generation from mass spectra by separately reasoning about atoms and bonds, achieving state-of-the-art accuracy.
Contribution
The paper proposes a novel dual-stream architecture for molecular graph denoising, explicitly modeling atom and bond reasoning in separate but synchronized streams.
Findings
DualLGD achieves top-1 accuracy of 34.37% and 23.89% on benchmarks.
The architecture outperforms previous methods by approximately 3 times.
Ablation studies show the architecture itself is the main source of improvement.
Abstract
De novo molecular generation from tandem mass spectra is a challenging inverse problem whose core difficulty lies in the circular dependency between atom-level and bond-level reasoning: determining a bond's type requires knowing its endpoint atoms' chemical environment, yet an atom's environment is in turn defined by its incident bonds. Existing graph diffusion methods process atoms and bonds within a single computation stream, where atom-bond information synchronization can only occur implicitly across layers. We argue that this single-stream paradigm, rather than the choice of any particular aggregation kernel, is a key architectural bottleneck. We propose DualLGD (Dual-stream Line Graph Diffusion), which reformulates molecular graph denoising as the alternating solution of two coupled subproblems: atom-level reasoning and bond-level reasoning, each operating in its own dedicated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
