# Semi-parametric validation of genomic predictions and polygenic risk scores with the Blupf90 software suite

**Authors:** Matias Bermann, Alejandra Álvarez Múnera, Ignacy Misztal, Daniela Lourenco

PMC · DOI: 10.1093/g3journal/jkaf136 · G3: Genes | Genomes | Genetics · 2025-06-12

## TL;DR

Researchers developed a new tool called validationf90 to better validate genomic predictions and risk scores by accounting for population structure and focusing on specific individuals.

## Contribution

The novel contribution is the development of validationf90, a semi-parametric validation tool integrated into the Blupf90 software suite.

## Key findings

- validationf90 accounts for population structure and allows validation of specific individuals.
- The tool calculates validation statistics and confidence intervals without repeated cross-validation.
- It supports genomic predictions from frequentist, Bayesian methods and categorical data.

## Abstract

Validation of genomic predictions or polygenic risk scores is key for model selection and evaluating the performance of the chosen prediction machinery. Non-parametric validation, such as cross-validation, is popular but does not account for population structure and the fact that the interest could be in validating a set of individuals and not the entire population. Semi-parametric methods, such as the LR method, also use removed records to validate predictions, account for population structure, and allow focus on a specific set of individuals of interest. Confidence intervals are obtained using semi-parametric methods without the need for repeated cross-validation. We developed a tool within the Blupf90 software suite, called validationf90, that allows researchers to conduct semi-parametric validation from the solutions obtained from that software suite. validationf90 calculates different validation statistics and their confidence intervals for a pre-defined set of individuals of interest, reflecting the bias and accuracy of genomic predictions. The program allows for genomic predictions obtained from frequentist and Bayesian methods, as well as for categorical data. validationf90 can validate any model supported by the Blupf90 software suite and can be used with animal, plant, and human datasets. Predictions obtained with other software can be provided to validationf90 as long as the input format matches with the Blupf90 format.

Validation of genomic predictions or polygenic risk scores is crucial for model selection and validating the performance of the chosen model. Non-parametric cross-validation does not consider population structure, and that not all individuals are of the same interest for validation. Semi-parametric validation overcomes these issues. Bermann et al. developed a software called validationf90, which performs semi-parametric validation within the Blupf90 software suite. This type of validation, as well as the software tool, is applicable to animal, plant, and human datasets.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12341872/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12341872/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC12341872/full.md

---
Source: https://tomesphere.com/paper/PMC12341872