TL;DR
TorchGWAS is a GPU-accelerated framework enabling rapid large-scale GWAS analysis across thousands of phenotypes, significantly outperforming CPU-based tools.
Contribution
It introduces a GPU-based implementation of GWAS that dramatically increases throughput for phenotype-rich genomic screening workflows.
Findings
TorchGWAS completes 20,480 phenotypes in 20 minutes on a single GPU.
It achieves a 1700-fold increase in phenotype throughput compared to CPU-based methods.
Supports large datasets with millions of markers and tens of thousands of samples.
Abstract
Motivation: Modern bioinformatics workflows, particularly in imaging and representation learning, can generate thousands to tens of thousands of quantitative phenotypes from a single cohort. In such settings, running genome-wide association analyses trait by trait rapidly becomes a computational bottleneck. While established GWAS tools are highly effective for individual traits, they are not optimized for phenotype-rich screening workflows in which the same genotype matrix is reused across a large phenotype panel. Results: We present TorchGWAS, a framework for high-throughput association testing of large phenotype panels through hardware acceleration. The current public release provides stable Python and command-line workflows for linear GWAS and multivariate phenotype screening, supports NumPy, PLINK, and BGEN genotype inputs, aligns phenotype and covariate tables by sample identifier,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
