Rapid GPU-Based Pangenome Graph Layout
Jiajie Li, Jan-Niklas Schmelzle, Yixiao Du, Simon Heumos, Andrea, Guarracino, Giulia Guidi, Pjotr Prins, Erik Garrison, Zhiru Zhang

TL;DR
This paper presents a GPU-accelerated method for visualizing large pangenome graphs, achieving significant speedups while maintaining layout quality, thus enabling faster analysis of complex genomic data.
Contribution
It introduces a GPU-based implementation with three key optimizations for pangenome graph layout, significantly improving performance over CPU methods.
Findings
Achieves 57.3x speedup over CPU baseline.
Reduces layout computation from hours to minutes.
Maintains layout quality with new scalable metric.
Abstract
Computational Pangenomics is an emerging field that studies genetic variation using a graph structure encompassing multiple genomes. Visualizing pangenome graphs is vital for understanding genome diversity. Yet, handling large graphs can be challenging due to the high computational demands of the graph layout process. In this work, we conduct a thorough performance characterization of a state-of-the-art pangenome graph layout algorithm, revealing significant data-level parallelism, which makes GPUs a promising option for compute acceleration. However, irregular data access and the algorithm's memory-bound nature present significant hurdles. To overcome these challenges, we develop a solution implementing three key optimizations: a cache-friendly data layout, coalesced random states, and warp merging. Additionally, we propose a quantitative metric for scalable evaluation of pangenome…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms
