A Memory-Efficient FM-Index Constructor for Next-Generation Sequencing Applications on FPGAs
Nae-Chyun Chen, Yu-Cheng Li, Yi-Chang Lu

TL;DR
This paper introduces a novel, memory-efficient FPGA-based FM-index constructor tailored for next-generation sequencing, enabling fast, compact, and hardware-friendly sequence indexing for small-scale DNA reads.
Contribution
It proposes a hardware-compatible Self-Aided Incremental Indexing algorithm and architecture that significantly reduces memory overhead and enhances efficiency for FM-index construction on FPGAs.
Findings
Supports DNA sequences up to 131,072 bp
Requires minimal hardware resources
Achieves efficient FM-index construction on FPGA
Abstract
FM-index is an efficient data structure for string search and is widely used in next-generation sequencing (NGS) applications such as sequence alignment and de novo assembly. Recently, FM-indexing is even performed down to the read level, raising a demand of an efficient algorithm for FM-index construction. In this work, we propose a hardware-compatible Self-Aided Incremental Indexing (SAII) algorithm and its hardware architecture. This novel algorithm builds FM-index with no memory overhead, and the hardware system for realizing the algorithm can be very compact. Parallel architecture and a special prefetch controller is designed to enhance computational efficiency. An SAII-based FM-index constructor is implemented on an Altera Stratix V FPGA board. The presented constructor can support DNA sequences of sizes up to 131,072-bp, which is enough for small-scale references and reads…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Genomics and Phylogenetic Studies · DNA and Biological Computing
