Neural Graph Matching Improves Retrieval Augmented Generation in Molecular Machine Learning
Runzhong Wang, Rui-Xi Wang, Mrunali Manjrekar, Connor W. Coley

TL;DR
This paper introduces MARASON, a neural graph matching approach that significantly improves retrieval-augmented molecular generation, achieving higher accuracy in mass spectrum simulation by explicitly modeling structural alignments.
Contribution
The paper presents a novel neural graph matching method integrated into molecular generation, enhancing retrieval-augmented models with explicit structural alignment capabilities.
Findings
MARASON achieves 28% top-1 accuracy in mass spectrum prediction.
Outperforms non-retrieval state-of-the-art and traditional graph matching methods.
Code is publicly available for reproducibility.
Abstract
Molecular machine learning has gained popularity with the advancements of geometric deep learning. In parallel, retrieval-augmented generation has become a principled approach commonly used with language models. However, the optimal integration of retrieval augmentation into molecular machine learning remains unclear. Graph neural networks stand to benefit from clever matching to understand the structural alignment of retrieved molecules to a query molecule. Neural graph matching offers a compelling solution by explicitly modeling node and edge affinities between two structural graphs while employing a noise-robust, end-to-end neural network to learn affinity metrics. We apply this approach to mass spectrum simulation and introduce MARASON, a novel model that incorporates neural graph matching to enhance a fragmentation-based neural network. Experimental results highlight the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning in Materials Science · Machine Learning in Bioinformatics · Advanced Graph Neural Networks
