# STRIKER: a spectral metadata repairing tool for expanding the comprehensiveness of spectral libraries

**Authors:** Ahmed Karam, Asmaa Ramzy, Taghreed Khaled Abdelmoneim, Maha Mokhtar, Nada A. Youssef, Aya Osama, Nabila Sabar, Sameh Magdeldin

PMC · DOI: 10.1186/s13321-026-01150-4 · Journal of Cheminformatics · 2026-01-27

## TL;DR

STRIKER is a tool that improves the quality of spectral libraries by repairing adduct metadata, helping researchers identify metabolites more accurately.

## Contribution

STRIKER introduces a deep learning-based method for repairing adduct metadata in spectral libraries with a user-friendly interface.

## Key findings

- STRIKER achieved 95–99% correct adduct matching and 98% adduct correction accuracy.
- The tool reduces missing or unusable spectra, improving metabolite identification and machine learning data quality.
- STRIKER supports the creation of customized sub-libraries and integrates with the Human Metabolome Database.

## Abstract

The expansion of untargeted metabolomics has made publicly accessible spectral libraries indispensable for metabolite annotation and machine learning applications. Enhancing the quality and consistency of these libraries is crucial for improving the accuracy of metabolite identification and training machine learning models. However, public spectral libraries often suffer from variability in user submissions, unintentional errors, and a lack of standardization. Existing metadata cleaning and normalization tools typically exclude spectra with incorrect or unsupported metadata rather than attempting to correct them, resulting in the loss of valuable spectral data and associated metabolites details. This study introduces STRIKER (SpecTRal lIbrary maKER), a repair tool specifically designed to address adduct metadata deficiencies using a distance-based metric and a deep learning model. STRIKER leverages advanced similarity-based approaches to predict adducts in spectra lacking adduct metadata. It corrects adduct-related errors and standardizes adduct formatting using a deep learning model based on the multi-layer perceptron (MLP) algorithm. STRIKER achieved 95–99% correct adduct matching and 98% adduct correction accuracy. These corrections substantially reduce the number of missing or unusable spectra and metabolites, thereby enhancing the accuracy of metabolite identification and improving data quality for machine learning applications. The tool also facilitates a convenient construction of the Human Metabolome Database (HMDB) spectral library by integrating data files from the HMDB website. Furthermore, it enables users to extract customized sub libraries from larger libraries, supporting tailored analyses for specific research objectives with percised search space. STRIKER is an open-source, user-friendly Python graphical interface designed to be accessible to researchers with minimal bioinformatics expertise. Available at the following repository under an MIT license: https://striker-gui.sourceforge.io.

Scientific contribution

The software is designed to preserve the maximum number of valid spectra in open mass spectral libraries, thereby supporting more comprehensive metabolite annotation in untargeted metabolomics. Its graphical user interface further facilitates the engagement of researchers without programming expertise, enabling them to enhance the quality and usability of spectral libraries.

The online version contains supplementary material available at 10.1186/s13321-026-01150-4.

## Full-text entities

- **Diseases:** adduct metadata deficiencies (MESH:C562949)
- **Chemicals:** metabolites (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12918191/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12918191/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC12918191/full.md

---
Source: https://tomesphere.com/paper/PMC12918191