# A parametric bootstrap approach for computing confidence intervals for genetic correlations with application to genetically determined protein-protein networks

**Authors:** Yi-Ting Tsai, Yana Hrytsenko, Michael Elgart, Usman A. Tahir, Zsu-Zsu Chen, James G. Wilson, Robert E. Gerszten, Tamar Sofer

PMC · DOI: 10.1016/j.xhgg.2024.100304 · Human Genetics and Genomics Advances · 2024-05-08

## TL;DR

The paper introduces a new method to compute confidence intervals for genetic correlations, especially useful for large-scale studies of protein-protein interactions.

## Contribution

A parametric bootstrap procedure is proposed to improve confidence interval estimation for genetic correlations in large omics datasets.

## Key findings

- The parametric bootstrap method outperforms asymptotic approaches in small samples or at parameter boundaries.
- The method is demonstrated on a proteomics dataset with tens of thousands of protein pairs.
- The procedure uses Haseman-Elston regression and kinship matrices to estimate genetic correlations.

## Abstract

Genetic correlation refers to the correlation between genetic determinants of a pair of traits. When using individual-level data, it is typically estimated based on a bivariate model specification where the correlation between the two variables is identifiable and can be estimated from a covariance model that incorporates the genetic relationship between individuals, e.g., using a pre-specified kinship matrix. Inference relying on asymptotic normality of the genetic correlation parameter estimates may be inaccurate when the sample size is low, when the genetic correlation is close to the boundary of the parameter space, and when the heritability of at least one of the traits is low. We address this problem by developing a parametric bootstrap procedure to construct confidence intervals for genetic correlation estimates. The procedure simulates paired traits under a range of heritability and genetic correlation parameters, and it uses the population structure encapsulated by the kinship matrix. Heritabilities and genetic correlations are estimated using the close-form, method of moment, Haseman-Elston regression estimators. The proposed parametric bootstrap procedure is especially useful when genetic correlations are computed on pairs of thousands of traits measured on the same exact set of individuals. We demonstrate the parametric bootstrap approach on a proteomics dataset from the Jackson Heart Study.

The authors developed a method to estimate CIs for genetic correlations. The method is particularly useful when applied on a large panel of omics assays. The authors demonstrate its application to study genetic correlations between tens of thousands of protein pairs.

## Full-text entities

- **Genes:** C5AR1 (complement C5a receptor 1) [NCBI Gene 728] {aka C5A, C5AR, C5R1, CD88}, MMP2 (matrix metallopeptidase 2) [NCBI Gene 4313] {aka CLG4, CLG4A, MMP-2, MMP-II, MONA, TBE-1}, JAG1 (jagged canonical Notch ligand 1) [NCBI Gene 182] {aka AGS, AGS1, AHD, AWS, CD339, CMT2HH}, NOTCH3 (notch receptor 3) [NCBI Gene 4854] {aka CADASIL, CADASIL1, CARASIL1, CASIL, FPLD1, IMF2}, NTRK3 (neurotrophic receptor tyrosine kinase 3) [NCBI Gene 4916] {aka GP145-TrkC, TRKC, gp145(trkC)}, APOD (apolipoprotein D) [NCBI Gene 347], CDK5R1 (cyclin dependent kinase 5 regulatory subunit 1) [NCBI Gene 8851] {aka CDK5P35, CDK5R, NCK5A, p23, p25, p35}, PAK3 (p21 (RAC1) activated kinase 3) [NCBI Gene 5063] {aka ARA, MRX30, MRX47, OPHN3, PAK-3, PAK3beta}, FGFR1 (fibroblast growth factor receptor 1) [NCBI Gene 2260] {aka BFGFR, CD331, CEK, ECCL, FGFBR, FGFR-1}, DKK4 (dickkopf Wnt signaling pathway inhibitor 4) [NCBI Gene 27121] {aka DKK-4}, AMH (anti-Mullerian hormone) [NCBI Gene 268] {aka MIF, MIS}, OMD (osteomodulin) [NCBI Gene 4958] {aka OSAD, SLRR2C}
- **Diseases:** JHS (MESH:D006331), overweight (MESH:D050177)
- **Chemicals:** GCTA (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11140211/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11140211/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC11140211/full.md

---
Source: https://tomesphere.com/paper/PMC11140211