A harmonized benchmarking framework for implementation-aware evaluation of 46 polygenic risk score tools across binary and continuous phenotypes
Muhammad Muneeb, David B. Ascher

TL;DR
This paper introduces a comprehensive benchmarking framework for evaluating 46 polygenic risk score tools across various phenotypes, considering performance, computational efficiency, and robustness, to facilitate fair comparison and reproducibility.
Contribution
The authors developed a standardized, implementation-aware benchmarking framework that evaluates multiple PRS tools across diverse phenotypes and model configurations, addressing previous comparison challenges.
Findings
Significant differences in tool rankings across phenotypes and folds.
No single PRS method is universally optimal.
Performance depends on statistical methodology, phenotype architecture, and implementation factors.
Abstract
Polygenic risk score (PRS) tools differ substantially in statistical assumptions, input requirements, and implementation complexity, making direct comparison difficult. We developed a harmonized, implementation-aware benchmarking framework to evaluate 46 PRS tools across seven binary UK Biobank phenotypes and one continuous trait under three model configurations: null, PRS-only, and PRS plus covariates. The framework integrates standardized preprocessing, tool-specific execution, hyperparameter exploration, and unified downstream evaluation using five-fold cross-validation on high-performance computing infrastructure. In addition to predictive performance, we assessed runtime, memory use, input dependencies, and failure modes. A Friedman test across 40 phenotype--fold combinations confirmed significant differences in tool rankings (, ), with no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genomics and Rare Diseases · Bioinformatics and Genomic Networks
