# Multiparametric Optimization of Data-Dependent Acquisition Towards More Holistic Bacterial Metabolite Coverage Through Molecular Networking

**Authors:** Adivhaho Khwathisi, Amidou Samie, Asfatou Ndama Traore, Ntakadzeni Edwin Madala

PMC · DOI: 10.1155/ijm/4388417 · 2025-07-21

## TL;DR

This paper explores how optimizing mass spectrometry settings can improve detection of bacterial metabolites, leading to better coverage and analysis through molecular networking.

## Contribution

The study introduces a novel approach using design of experiments to optimize data acquisition settings for broader metabolite coverage in bacterial metabolomics.

## Key findings

- Adjusting collision energy significantly improves metabolite coverage compared to adjusting detection thresholds.
- Molecular networks from averaged spectral data outperform those from customized settings.
- Combined optimization of collision energy and intensity threshold yields greater metabolite coverage than single-point optimization.

## Abstract

Prokaryotic organisms rely on a limited array of metabolites for survival, which varies according to their natural environment. For example, soil-borne bacteria produce diverse metabolites, such as antibiotics, to thrive in their competitive surroundings, inhibiting the growth of nearby competing bacteria. The structural diversity of these compounds offers great analytical challenges, since there is no universal acquisition setting that can be applied to achieve their comprehensive coverage. Therefore, the use of a single experimental setup inevitably hinders the comprehensive metabolite coverage, which would affect the outputs. To address this, we propose employing a design of experiment (DoE) approach through the central composite design (CCD) to enhance the metabolite detection and broaden the coverage of the data-dependent acquisition (DDA) mode of the UHPLC-qTOF-MS technique. Our study reveals that altering collision energy significantly enhances metabolite coverage compared to adjusting the DDA threshold of detection. Furthermore, the ability of global natural product social (GNPS)–based molecular network models to annotate metabolites is greatly influenced by data acquisition settings, particularly affecting MS2 data. Interestingly, molecular networks constructed from averaged spectral data obtained through randomly selected DDA settings outperform those generated using customized settings through DoE modeling. This study demonstrates that in untargeted LC-MS metabolomics, both collision energy and intensity threshold independently enhance metabolite coverage in untargeted metabolomics. However, their combined use results in even greater coverage. Consequently, we recommend adopting group-based optimization over single-point optimization for more comprehensive metabolite coverage and in-depth exploration. However, caution should be taken in order to balance between robust data and redundancy.

## Full-text entities

- **Diseases:** CCD (MESH:D058617), RSM (MESH:D010534), MS (MESH:D009103), DDA (MESH:D019966)
- **Chemicals:** A, B, C, and D (-), Glu (MESH:D018698), NaI (MESH:D012974), H (MESH:D006859), nitrogen (MESH:D009584), Na (MESH:D012964), Phe (MESH:D010649), methanol (MESH:D000432), Asp (MESH:D001224), Leu (MESH:D007930), water (MESH:D014867), growth hormones (MESH:D013006), argon (MESH:D001128), dipeptide (MESH:D004151), formic acid (MESH:C030544), lipopeptides (MESH:D055666)
- **Species:** Bacillus subtilis (species) [taxon 1423], Homo sapiens (human, species) [taxon 9606]

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12303636/full.md

---
Source: https://tomesphere.com/paper/PMC12303636