Supervised topological data analysis for MALDI mass spectrometry imaging applications
Gideon Klaila, Vladimir Vutov, Anastasios Stefanou

TL;DR
This paper introduces a novel algebraic topological framework for analyzing MALDI MSI data, improving tumor subtype classification by denoising and compressing spectral information, with demonstrated effectiveness on lung cancer datasets.
Contribution
The paper presents a new topological data analysis method that enhances MALDI data interpretation, noise reduction, and classification efficiency for cancer subtype identification.
Findings
Effective noise filtering using topological persistence.
Data compression reduces storage and computation time.
Competitive classification accuracy on real-world datasets.
Abstract
Background: Matrix-assisted laser desorption/ionization mass spectrometry imaging (MALDI MSI) displays significant potential for applications in cancer research, especially in tumor typing and subtyping. Lung cancer is the primary cause of tumor-related deaths, where the most lethal entities are adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). Distinguishing between these two common subtypes is crucial for therapy decisions and successful patient management. Results: We propose a new algebraic topological framework, which obtains intrinsic information from MALDI data and transforms it to reflect topological persistence. Our framework offers two main advantages. Firstly, topological persistence aids in distinguishing the signal from noise. Secondly, it compresses the MALDI data, saving storage space and optimizes computational time for subsequent classification tasks. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Geochemistry and Geologic Mapping · Advanced Proteomics Techniques and Applications
MethodsLogistic Regression
