Scrooge: A Fast and Memory-Frugal Genomic Sequence Aligner for CPUs, GPUs, and ASICs
Jo\"el Lindegger, Damla Senol Cali, Mohammed Alser, Juan G\'omez-Luna,, Nika Mansouri Ghiasi, Onur Mutlu

TL;DR
Scrooge is a novel genomic sequence aligner that significantly improves speed and reduces memory usage across CPUs, GPUs, and ASICs by optimizing the GenASM algorithm with three key innovations.
Contribution
We introduce three algorithmic improvements to GenASM, making Scrooge faster and more memory-efficient for genomic sequence alignment on various hardware platforms.
Findings
CPU version of Scrooge is 20.1x faster than KSW2.
GPU version of Scrooge achieves up to 80.4x speedup over other implementations.
ASIC implementation of Scrooge uses 3.6x less chip area and 2.1x less power.
Abstract
Pairwise sequence alignment is a very time-consuming step in common bioinformatics pipelines. Speeding up this step requires heuristics, efficient implementations, and/or hardware acceleration. A promising candidate for all of the above is the recently proposed GenASM algorithm. We identify and address three inefficiencies in the GenASM algorithm: it has a high amount of data movement, a large memory footprint, and does some unnecessary work. We propose Scrooge, a fast and memory-frugal genomic sequence aligner. Scrooge includes three novel algorithmic improvements which reduce the data movement, memory footprint, and the number of operations in the GenASM algorithm. We provide efficient open-source implementations of the Scrooge algorithm for CPUs and GPUs, which demonstrate the significant benefits of our algorithmic improvements. For long reads, the CPU version of Scrooge achieves a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Algorithms and Data Compression · Machine Learning in Bioinformatics
