# PGSXplorer: an integrated nextflow pipeline for comprehensive quality control and polygenic score model development

**Authors:** Tutku Yaraş, Yavuz Oktay, Gökhan Karakülah

PMC · DOI: 10.7717/peerj.18973 · PeerJ · 2025-02-12

## TL;DR

PGSXplorer is a new pipeline that streamlines genomic data quality control and polygenic score model development using multiple tools and open-source software.

## Contribution

Introduces PGSXplorer, an integrated Nextflow pipeline combining multiple tools for standardized polygenic score development.

## Key findings

- PGSXplorer integrates tools like PLINK and PRSice-2 for quality control and PGS construction.
- The pipeline is scalable and effective for single- and multi-population analyses using Docker.
- PGSXplorer is open-source and available at GitHub for precision medicine research.

## Abstract

The rapid development of next-generation sequencing technologies and genomic data sharing initiatives during the post-Human Genome Project-era has catalyzed major advances in individualized medicine research. Genome-wide association studies (GWAS) have become a cornerstone of efforts towards understanding the genetic basis of complex diseases, leading to the development of polygenic scores (PGS). Despite their immense potential, the scarcity of standardized PGS development pipelines limits widespread adoption of PGS. Herein, we introduce PGSXplorer, a comprehensive Nextflow DSL2 pipeline that enables quality control of genomic data and automates the phasing, imputation, and construction of PGS models using reference GWAS data. PGSXplorer integrates various PGS development tools such as PLINK, PRSice-2, LD-Pred2, Lassosum2, MegaPRS, SBayesR-C, PRS-CSx and MUSSEL, improving the generalizability of PGS through multi-origin data integration. Tested with synthetic datasets, our fully Docker-encapsulated tool has demonstrated scalability and effectiveness for both single- and multi-population analyses. Continuously updated as an open-source tool, PGSXplorer is freely available with user tutorials at https://github.com/tutkuyaras/PGSXplorer, making it a valuable resource for advancing precision medicine in genetic research.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11829630/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11829630/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/PMC11829630/full.md

---
Source: https://tomesphere.com/paper/PMC11829630