# MSIMG: A Density-Aware Multi-Channel Image Representation Method for Mass Spectrometry

**Authors:** Fengyi Zhang, Boyong Gao, Yinchu Wang, Lin Guo, Wei Zhang, Xingchuang Xiong

PMC · DOI: 10.3390/s25206363 · Sensors (Basel, Switzerland) · 2025-10-15

## TL;DR

MSIMG is a new method that improves the representation of mass spectrometry data for better deep learning performance in phenotype classification.

## Contribution

MSIMG introduces a density-aware, content-driven patch selection strategy inspired by computer vision for mass spectrometry data representation.

## Key findings

- MSIMG outperforms traditional peak list and grid-based methods in phenotype classification tasks.
- The density-peak-centric strategy enhances information fidelity and model performance in deep learning.
- Applying computer vision techniques to mass spectrometry data shows promise for clinical diagnostics.

## Abstract

Extracting key features for phenotype classification from high-dimensional and complex mass spectrometry (MS) data presents a significant challenge. Conventional data representation methods, such as traditional peak lists or grid-based imaging strategies, are often hampered by information loss and compromised signal integrity, thereby limiting the performance of downstream deep learning models. To address this issue, we propose a novel data representation framework named MSIMG. Inspired by object detection in computer vision, MSIMG introduces a data-driven, “density-peak-centric” patch selection strategy. This strategy employs density map estimation and non-maximum suppression algorithms to locate the centers of signal-dense regions, which serve as anchors for dynamic, content-aware patch extraction. This process transforms raw mass spectrometry data into a multi-channel image representation with higher information fidelity. Extensive experiments conducted on two public clinical mass spectrometry datasets demonstrate that MSIMG significantly outperforms both the traditional peak list method and the grid-based MetImage approach. This study confirms that the MSIMG framework, through its content-aware patch selection, provides a more information-dense and discriminative data representation paradigm for deep learning models. Our findings highlight the decisive impact of data representation on model performance and successfully demonstrate the immense potential of applying computer vision strategies to analytical chemistry data, paving the way for the development of more robust and precise clinical diagnostic models.

## Full-text entities

- **Diseases:** injury to (MESH:D014947), MS (MESH:C536030), CD (MESH:D003424)
- **Chemicals:** lipids (MESH:D008055)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12568316/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12568316/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC12568316/full.md

---
Source: https://tomesphere.com/paper/PMC12568316