MegIS: High-Performance, Energy-Efficient, and Low-Cost Metagenomic Analysis with In-Storage Processing
Nika Mansouri Ghiasi, Mohammad Sadrosadati, Harun Mustafa, Arvid, Gollwitzer, Can Firtina, Julien Eudine, Haiyu Mao, Jo\"el Lindegger, Meryem, Banu Cavlak, Mohammed Alser, Jisung Park, Onur Mutlu

TL;DR
MegIS is a novel in-storage processing system that significantly reduces data movement and accelerates metagenomic analysis, outperforming existing software and hardware solutions in speed and accuracy.
Contribution
This paper introduces MegIS, the first in-storage processing system tailored for metagenomics, addressing hardware limitations and optimizing data flow for high performance and energy efficiency.
Findings
MegIS achieves up to 37.2× speedup over software tools.
MegIS outperforms hardware-accelerated metagenomic tools by up to 5.1×.
MegIS maintains high accuracy comparable to existing tools.
Abstract
Metagenomics has led to significant advances in many fields. Metagenomic analysis commonly involves the key tasks of determining the species present in a sample and their relative abundances. These tasks require searching large metagenomic databases. Metagenomic analysis suffers from significant data movement overhead due to moving large amounts of low-reuse data from the storage system. In-storage processing can be a fundamental solution for reducing this overhead. However, designing an in-storage processing system for metagenomics is challenging because existing approaches to metagenomic analysis cannot be directly implemented in storage effectively due to the hardware limitations of modern SSDs. We propose MegIS, the first in-storage processing system designed to significantly reduce the data movement overhead of the end-to-end metagenomic analysis pipeline. MegIS is enabled by our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification
