Parallel DNA Sequence Alignment on High-Performance Systems with CUDA and MPI
Linus Zwaka

TL;DR
This paper presents a hybrid CUDA and MPI implementation of the Needleman-Wunsch algorithm, significantly accelerating DNA sequence alignment on high-performance systems by leveraging GPU and distributed computing.
Contribution
It introduces a novel hybrid parallelization approach combining CUDA and MPI for efficient large-scale sequence alignment without altering the core algorithm.
Findings
Achieves significant speedup over CPU-based methods
Effective handling of large datasets and multiple alignments
Demonstrates scalability on supercomputing systems
Abstract
Sequence alignment is a cornerstone of bioinformatics, widely used to identify similarities between DNA, RNA, and protein sequences and studying evolutionary relationships and functional properties. The Needleman-Wunsch algorithm remains a robust and accurate method for global sequence alignment. However, its computational complexity, O(mn), poses significant challenges when processing large-scale datasets or performing multiple sequence alignments. To address these limitations, a hybrid implementation of the Needleman-Wunsch algorithm that leverages CUDA for parallel execution on GPUs and MPI for distributed computation across multiple nodes on a supercomputer is proposed. CUDA efficiently offloads computationally intensive tasks to GPU cores, while MPI enables communication and workload distribution across nodes to handle large-scale alignments. This work details the implementation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Genomics and Phylogenetic Studies · Machine Learning in Bioinformatics
