bioGWAS: A Simple and Flexible Tool for Simulating GWAS Datasets
Anton I. Changalidis, Dmitry A. Alexeev, Yulia A. Nasykhova, Andrey S. Glotov, Yury A. Barbitoff

TL;DR
bioGWAS is a new tool that simulates GWAS datasets with known genetic effects, helping researchers test and develop bioinformatics tools.
Contribution
bioGWAS introduces a flexible pipeline for simulating GWAS data with predefined causal genes and traits, enabling accurate benchmarking and testing.
Findings
bioGWAS can generate GWAS results with predefined causal genes and biological processes.
The tool successfully recapitulates published GWAS datasets using known genome-wide associations.
bioGWAS aids in benchmarking gene set enrichment analysis tools for GWAS data.
Abstract
Genome-wide association studies (GWAS) are a powerful tool for the identification of genes affecting human traits. Still, the interpretation of GWAS results is complicated, and new tools are actively being developed. Due to the scarcity of available datasets, simulation of GWAS data with known genetic effects is important as it enables accurate evaluation of such tools. In this study, we developed a flexible tool, bioGWAS, that provides a set of important functionalities for simulating GWAS results. We demonstrate that bioGWAS can efficiently generate GWAS results with predefined causal genes and biological processes and is capable of recapitulating the results of published GWAS studies. We thus believe that bioGWAS is an excellent method for testing bioinformatics software for GWAS results processing, as well as for the generation of datasets for educational purposes. Genome-wide…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genetic Mapping and Diversity in Plants and Animals · Genetic and phenotypic traits in livestock
