Adventures in Multi-Omics I: Combining heterogeneous data sets via   relationships matrices

Deniz Akdemir; Julio Isidro Sanchez

arXiv:1912.03358·stat.AP·January 13, 2020

Adventures in Multi-Omics I: Combining heterogeneous data sets via relationships matrices

Deniz Akdemir, Julio Isidro Sanchez

PDF

Open Access

TL;DR

This paper introduces a covariance-based method using an expectation-maximization algorithm to combine partial, heterogeneous genomic relationship matrices, enhancing genomic prediction and phenomics analysis.

Contribution

The paper presents a novel EM algorithm for integrating overlapping relationship matrices, offering an alternative to feature imputation in multi-omics data integration.

Findings

01

Accurate combination of genomic relationship matrices demonstrated.

02

Improved genomic prediction using combined data sets.

03

Method applicable to various heterogeneous genotype-phenotype data.

Abstract

In this article, we propose a covariance based method for combining partial data sets in the genotype to phenotype spectrum. In particular, an expectation-maximization algorithm that can be used to combine partially overlapping relationship/covariance matrices is introduced. Combining data this way, based on relationship matrices, can be contrasted with a feature imputation based approach. We used several public genomic data sets to explore the accuracy of combining genomic relationship matrices. We have also used the heterogeneous genotype/phenotype data sets in the https://triticeaetoolbox.org/ to illustrate how this new method can be used in genomic prediction, phenomics, and graphical modeling.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenetic Mapping and Diversity in Plants and Animals · Genetic and phenotypic traits in livestock · Gene expression and cancer classification