HiCOPS: High Performance Computing Framework for Tera-Scale Database Search of Mass Spectrometry based Omics Data
Muhammad Haseeb, Fahad Saeed

TL;DR
HiCOPS is a high-performance computing framework that significantly accelerates mass spectrometry-based proteomics data analysis, achieving over 100-fold speedup by optimizing parallel processing and communication strategies on distributed architectures.
Contribution
The paper introduces HiCOPS, a novel parallel computational method and implementation that drastically improves the scalability and speed of peptide identification in large-scale MS data analysis.
Findings
Achieves over 100-fold speedup compared to existing tools.
Completes peptide identification in minutes on 72 nodes versus weeks on single nodes.
Demonstrates superior performance in execution time, CPU utilization, and I/O efficiency.
Abstract
Database-search algorithms, that deduce peptides from Mass Spectrometry (MS) data, have tried to improve the computational efficiency to accomplish larger, and more complex systems biology studies. Existing serial, and high-performance computing (HPC) search engines, otherwise highly successful, are known to exhibit poor-scalability with increasing size of theoretical search-space needed for increased complexity of modern non-model, multi-species MS-based omics analysis. Consequently, the bottleneck for computational techniques is the communication costs of moving the data between hierarchy of memory, or processing units, and not the arithmetic operations. This post-Moore change in architecture, and demands of modern systems biology experiments have dampened the overall effectiveness of the existing HPC workflows. We present a novel efficient parallel computational method, and its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Proteomics Techniques and Applications · Mass Spectrometry Techniques and Applications · Metabolomics and Mass Spectrometry Studies
