FindeR: Accelerating FM-Index-based Exact Pattern Matching in Genomic Sequences through ReRAM technology
Farzaneh Zokaee, Mingzhe Zhang, Lei Jiang

TL;DR
FindeR leverages ReRAM technology to significantly accelerate exact pattern matching in genomic sequences, addressing the limitations of traditional hardware accelerators by improving throughput and energy efficiency in FM-Index searches.
Contribution
The paper introduces a novel ReRAM-based process-in-memory architecture, FindeR, which enhances FM-Index exact pattern matching throughput and energy efficiency for genomic analysis.
Findings
FindeR achieves up to 30,000x throughput improvement over existing accelerators.
FindeR improves throughput per Watt by up to 42,500x.
The system provides a library for easy integration into genome analysis workflows.
Abstract
Genomics is the critical key to enabling precision medicine, ensuring global food security and enforcing wildlife conservation. The massive genomic data produced by various genome sequencing technologies presents a significant challenge for genome analysis. Because of errors from sequencing machines and genetic variations, approximate pattern matching (APM) is a must for practical genome analysis. Recent work proposes FPGA, ASIC and even process-in-memory-based accelerators to boost the APM throughput by accelerating dynamic-programming-based algorithms (e.g., Smith-Waterman). However, existing accelerators lack the efficient hardware acceleration for the exact pattern matching (EPM) that is an even more critical and essential function widely used in almost every step of genome analysis including assembly, alignment, annotation and compression. State-of-the-art genome analysis adopts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Genomics and Phylogenetic Studies · Network Packet Processing and Optimization
