# Cross ionization mode chemical similarity prediction between tandem mass spectra in metabolomics

**Authors:** Niek F. de Jonge, Elena Chekmeneva, Robin Schmid, David Joas, Lem-Joe Truong, Justin J. J. van der Hooft, Florian Huber

PMC · DOI: 10.1038/s41467-026-69083-y · Nature Communications · 2026-02-07

## TL;DR

This paper introduces a machine learning model that improves the comparison of metabolite data across different ionization modes in mass spectrometry.

## Contribution

The novel contribution is a machine learning model, MS2DeepScore 2.0, that enables cross-ionization mode chemical similarity prediction.

## Key findings

- MS2DeepScore 2.0 successfully enables cross-ionization mode molecular networking.
- The model includes a quality estimation method to flag unreliable predictions.
- The tool enhances metabolite annotation and data exploration in untargeted metabolomics.

## Abstract

Mass spectrometry is a cornerstone of untargeted metabolomics, enabling the characterization of metabolites in both positive and negative ionization modes. However, comparisons across ionization modes have remained a substantial challenge due to the distinct fragmentation patterns produced by each polarity. To overcome this barrier, we present MS2DeepScore 2.0, a machine learning-based model to predict chemical similarity between mass fragmentation spectra, which works both between different and the same ionization modes. We demonstrate the utility of MS2DeepScore 2.0 in three case studies, where MS2DeepScore enabled cross-ionization mode molecular networking, enhancing data exploration and metabolite annotation. To ensure robustness, we have implemented a quality estimation method that flags spectra with low information content or those dissimilar to the training data, thereby minimizing false predictions. Altogether, MS2DeepScore 2.0 extends our current capabilities in organizing, exploring, and annotating untargeted metabolomics profiles.

Mass spectrometry is a cornerstone of untargeted metabolomics, but comparisons across ionization modes have remained a substantial challenge due to the distinct fragmentation patterns produced by each polarity. Here, the authors present MS2DeepScore 2.0, a machine learning-based model to predict chemical similarity between mass fragmentation spectra, which works both between different and the same ionization modes.

## Full-text entities

- **Chemicals:** Rutin (MESH:D012431), caffeine (MESH:D002110), Na (MESH:D012964), methanol (MESH:D000432), nucleotides (MESH:D009711), nucleosides (MESH:D009705), water (MESH:D014867), SRM1950 (-), oxygen (MESH:D010100), Lipid (MESH:D008055), methyl tert-butyl ether (MESH:C043243), K (MESH:D011188)
- **Species:** Rumex sanguineus (species) [taxon 291096], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12992594/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12992594/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/PMC12992594/full.md

---
Source: https://tomesphere.com/paper/PMC12992594