Hardware Architecture for Inplace Compute of Burrows-Wheeler Transform in a Single Iteration
Kleber Stangherlin, Andrew Kennings

TL;DR
This paper introduces a hardware architecture for in-place computation of the Burrows-Wheeler transform that is fixed in performance, resource-efficient, and suitable for FPGA and ASIC implementations, enabling high throughput for data compression.
Contribution
The paper presents a novel hardware design for in-place BWT computation that does not require explicit suffix or output arrays, improving efficiency and performance.
Findings
FPGA implementation achieves up to 66 MB/s for 128 B blocks.
ASIC implementation reaches 161 MB/s for 128 B blocks.
Design does not depend on input string content or use block RAM.
Abstract
The Burrows-Wheeler transform (BWT) is used by the bzip2 family of compressors. In this paper, we present a hardware architecture that implements an inplace algorithm to compute the BWT. Our design does not have explicit storage for the suffix array, or output array. The performance of our implementation is fixed, and does not depend on the input string content. We use a register based character buffer in a scanchain configuration, such that the BWT is computed from right to left, as characters are loaded. Loading new characters is done every six cycles, producing a new output character from the previously computed block at the same rate. Our FGPA implementation does not use block ram instances, and achieves throughput of 66, 35, 18, and 15 MB/s for block sizes of 128 B, 1 kB, 4 kB, and 8 kB. We also report results for an ASIC implementation in 65 nm CMOS that achieves 161 MB/s when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Algorithms and Data Compression · Advanced Data Compression Techniques
