# Genomic estimates of Identity-By-Descent relationships in large scale data sets

**Authors:** Theo Meuwisen, Xijiang Yu, Peer Berg

PMC · DOI: 10.1186/s12711-026-01038-9 · 2026-03-13

## TL;DR

This paper introduces a new method to estimate genetic relationships using genomic data, which is more accurate and efficient for managing inbreeding in large cattle populations.

## Contribution

The novel contribution is the development of FGla, a fast and accurate algorithm for estimating Identity-By-Descent relationships using dense marker genotypes in large pedigrees.

## Key findings

- FGla estimates of genomic relationships achieved high accuracy (up to 0.998) when applied to Norwegian Red Cattle data.
- Gla relationships were approximately unbiased in BLUP analysis and outperformed GRM and ROH-based methods.
- The algorithm is computationally efficient and suitable for genomic inbreeding management and optimal contribution selection.

## Abstract

Genomic relationship and inbreeding estimates are either based on genetic drift (e.g. the Genomic Relationship Matrix; GRM), homozygosity (e.g. Runs of Homozygosity; ROH), or Identity-By-Descent (IBD). A genomic IBD-based relationship matrix, Gla, is obtained by linkage analysis which uses genomic data to distinguish paternal versus maternal inheritances of chromosomal segments to replace the 50/50 probabilities used to calculate pedigree-based relationships (A matrix). Our aim was to develop a fast approximate algorithm, FGla, to estimate the Gla matrix in large complex pedigrees making use of dense marker genotypes, and to compare Gla to A, GRM and ROH based inbreeding (FROH) in simulated and a large scale Norwegian Red Cattle (NRF) data set.

Given pedigree data and ≥ 3 generations of 45 k marker genotypes, marker positions were detected that unambiguously identified maternal/paternal inheritance, and inheritances at intermediate positions were imputed by the Viterbi algorithm from the positions with known inheritance. Any remaining unknown inheritances were randomly sampled (paternal or maternal), and the sampling errors that this introduced were averaged out by the large number of marker loci used (correlation between replicated estimates: 0.9998). Also, calculations were limited to the relationship coefficients that were actually needed, assuming that relationships for a limited set of candidates were needed. The accuracy of estimated Gla coefficients increased from 0.971 to 0.998, when genotyping increased from the actually genotyped NRF cattle towards all pedigreed animals. The accuracy of the GRM was 0.936, but required only genotyping of the animals whose relationships were needed. Gla relationships were approximately unbiased in the Best Linear Unbiased Prediction (BLUP) sense. Hence, if Gla based inbreeding management predicts an increase in relationships then an identical increase in true IBD relationships is expected. Gla uses the same base population as A, namely that of the pedigree.

An approximate computationally efficient multipoint linkage analysis algorithm was developed to estimate unbiased IBD-based relationship and inbreeding coefficients. Its unbiasedness and precise definition of the base population makes it well suited for the genomic management of inbreeding and genomic optimal contribution selection. In addition, Gla based optimal contribution selection is neutral with respect to allele frequency changes.

The online version contains supplementary material available at 10.1186/s12711-026-01038-9.

## Full-text entities

- **Genes:** Gla [NCBI Gene 104968418]
- **Diseases:** IBD (MESH:D009105), OCS (MESH:D009155), NRC (MESH:C537312)
- **Chemicals:** Gla (MESH:D017965), A (MESH:D001151), IDFA (-)
- **Species:** Sus scrofa (pig, species) [taxon 9823], Ovis aries (domestic sheep, species) [taxon 9940], Bos taurus (bovine, species) [taxon 9913]

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC13003672/full.md

---
Source: https://tomesphere.com/paper/PMC13003672