MFF-AE: Enhanced Quality Control for Proteomics Mass Spectrometry Data via Multi-Scale Feature Fusion
Guangkui Fan, Xinyu Ji, Hunyue Liao, Bo Meng, Duotao Pan, Jinze Huang, Yang Zhao

TL;DR
This paper introduces MFF-AE, a deep learning model that improves quality control in proteomics mass spectrometry data by detecting anomalous samples more accurately than existing methods.
Contribution
The novel MFF-AE model integrates multi-scale features using a deep learning autoencoder to enhance anomaly detection in proteomics data.
Findings
MFF-AE outperforms 15 baseline models in detecting anomalous samples on a benchmark dataset.
Excluding outliers identified by MFF-AE increases statistical significance and fold change in differential proteins in clinical datasets.
Abstract
Mass spectrometry (MS) is a core analytical tool in proteomics, and the quality of the generated data directly determines the effectiveness of downstream analyses and the reliability of final research conclusions. While MS is also widely used in other omics applications, this study focuses on label-free quantitative proteomics, where samples are represented as protein-abundance matrices derived from MaxQuant. However, MS data are typically characterized by high dimensionality and substantial noise, posing serious challenges for quality control (QC). Existing QC methods have limited feature extraction capabilities and struggled to capture the key information embedded in the data, resulting in poor performance in identifying anomalous samples. Here, we propose the Multi-Scale Feature Fusion-based Autoencoder (MFF-AE). This deep learning-based anomaly detection model achieves precise…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Proteomics Techniques and Applications · Machine Learning in Bioinformatics · Mass Spectrometry Techniques and Applications
