GPU-Accelerated BWT Construction for Large Collection of Short Reads
Chi-Man Liu, Ruibang Luo, Tak-Wah Lam

TL;DR
This paper introduces CX1, a GPU-accelerated tool for constructing the Burrows-Wheeler transform of large short-read collections, significantly reducing processing time compared to previous methods.
Contribution
CX1 is the first tool to leverage GPU, multi-core CPU, and cluster parallelism for efficient BWT construction of massive read datasets.
Findings
Constructs BWT of 100 Gb reads in under 2 hours on a single machine.
Achieves near-linear speedup with a cluster of 4 machines.
Outperforms previous fastest tool, BRC, which takes 12 hours for the same data.
Abstract
Advances in DNA sequencing technology have stimulated the development of algorithms and tools for processing very large collections of short strings (reads). Short-read alignment and assembly are among the most well-studied problems. Many state-of-the-art aligners, at their core, have used the Burrows-Wheeler transform (BWT) as a main-memory index of a reference genome (typical example, NCBI human genome). Recently, BWT has also found its use in string-graph assembly, for indexing the reads (i.e., raw data from DNA sequencers). In a typical data set, the volume of reads is tens of times of the sequenced genome and can be up to 100 Gigabases. Note that a reference genome is relatively stable and computing the index is not a frequent task. For reads, the index has to computed from scratch for each given input. The ability of efficient BWT construction becomes a much bigger concern than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Data Storage Technologies · Advanced Image and Video Retrieval Techniques
