SAIGE-GPU: accelerating genome- and phenome-wide association studies using GPUs
Alex Rodriguez, Youngdae Kim, Tarak Nath Nandi, Karl Keat, Rachit Kumar, Mitchell Conery, Rohan Bhukar, Molei Liu, John Hessington, Ketan Maheshwari, Sumitra Muralidhar, Sumitra Muralidhar, Jennifer Moser, Jennifer E Deen, Philip S Tsao, Sumitra Muralidhar, J Michael Gaziano

TL;DR
This paper introduces SAIGE-GPU, a faster version of a popular GWAS tool that uses GPUs to speed up large-scale genetic studies.
Contribution
The novel use of GPU acceleration in SAIGE for mixed-model GWAS computations enables practical phenome-wide studies.
Findings
SAIGE-GPU achieved a 5-fold speedup in mixed model fitting for large biobank datasets.
The method supports efficient analysis of diverse and admixed populations using GPU-optimized kernels.
Multi-core and multi-trait parallelization further improved performance on cloud platforms.
Abstract
Genome-wide association studies (GWAS) at biobank scale are computationally intensive, especially for admixed populations requiring robust statistical models. SAIGE is a widely used method for generalized linear mixed-model GWAS but is limited by its CPU-based implementation, making phenome-wide association studies impractical for many research groups. We developed SAIGE-GPU, a GPU-accelerated version of SAIGE that replaces CPU-intensive matrix operations with GPU-optimized kernels. The core innovation is distributing genetic relationship matrix calculations across GPUs and communication layers. Applied to 2068 phenotypes from 635 969 participants in the Million Veteran Program, including diverse and admixed populations, SAIGE-GPU achieved a 5-fold speedup in mixed model fitting on supercomputing infrastructure and cloud platforms. We further optimized the variant association testing…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Bioinformatics and Genomic Networks · Gene expression and cancer classification
