BIBI: Bayesian Inference of Breed Composition
Carlos Alberto Mart\'inez, Kshitij Khare, Mauricio A. Elzo

TL;DR
This paper introduces two Bayesian models, BIBI and BIBI2, for estimating breed composition from genetic marker data, with BIBI2 accounting for uncertainty in allele frequencies, leading to more accurate estimates.
Contribution
The paper develops Bayesian generalized linear models that improve breed composition estimation by incorporating uncertainty in allele frequencies, outperforming existing methods.
Findings
BIBI2 improves accuracy by 8.3% over BIBI.
Both models outperform the OLSK estimator.
Models successfully applied to Angus-Brahman population.
Abstract
The aim of this paper was to develop statistical models to estimate individual breed composition based on the previously proposed idea of regressing discrete random variables corresponding to counts of reference alleles of biallelic molecular markers located across the genome on the allele frequencies of each marker in the pure (base) breeds. Some of the existing regression-based methods do not guarantee that estimators of breed composition will lie in the appropriate parameter space and none of them account for uncertainty about allele frequencies in the pure breeds, that is, uncertainty about the design matrix. In order to overcome these limitations, we proposed two Bayesian generalized linear models. For each individual, both models assume that the counts of the reference allele at each marker locus follow independent Binomial distributions, use the logit link, and pose a Dirichlet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals · Genetic diversity and population structure
