Peak Alignment of Gas Chromatography-Mass Spectrometry Data with Deep Learning
Mike Li, X. Rosalind Wang

TL;DR
ChromAlignNet is a deep learning model that effectively aligns GC-MS data peaks across samples, outperforming existing methods especially on complex datasets, and requires no user-defined parameters.
Contribution
The paper introduces ChromAlignNet, a deep learning approach for GC-MS peak alignment that surpasses traditional methods in accuracy and ease of use.
Findings
Achieves high AUC (~1) on simple datasets
Maintains good performance (~0.85 AUC) on complex datasets
Outperforms existing algorithms on complex data sets
Abstract
We present ChromAlignNet, a deep learning model for alignment of peaks in Gas Chromatography-Mass Spectrometry (GC-MS) data. In GC-MS data, a compound's retention time (RT) may not stay fixed across multiple chromatograms. To use GC-MS data for biomarker discovery requires alignment of identical analyte's RT from different samples. Current methods of alignment are all based on a set of formal, mathematical rules. We present a solution to GC-MS alignment using deep learning neural networks, which are more adept at complex, fuzzy data sets. We tested our model on several GC-MS data sets of various complexities and analysed the alignment results quantitatively. We show the model has very good performance (AUC for simple data sets and AUC for very complex data sets). Further, our model easily outperforms existing algorithms on complex data sets. Compared with existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Computational Drug Discovery Methods · Analytical Chemistry and Chromatography
