Semi-supervised Cooperative Learning for Multiomics Data Fusion

Daisy Yi Ding; Xiaotao Shen; Michael Snyder; Robert Tibshirani

arXiv:2308.01458·q-bio.QM·August 4, 2023·ML4MHD

Semi-supervised Cooperative Learning for Multiomics Data Fusion

Daisy Yi Ding, Xiaotao Shen, Michael Snyder, Robert Tibshirani

PDF

Open Access

TL;DR

This paper introduces semi-supervised cooperative learning for multiomics data fusion, effectively leveraging unlabeled data to improve predictive accuracy in biological studies.

Contribution

It proposes a novel semi-supervised framework using an agreement penalty to incorporate unlabeled data into multiomics fusion, enhancing predictive performance.

Findings

01

Superior performance on simulated data

02

Effective in real aging multiomics study

03

Maximizes utility of labeled and unlabeled data

Abstract

Multiomics data fusion integrates diverse data modalities, ranging from transcriptomics to proteomics, to gain a comprehensive understanding of biological systems and enhance predictions on outcomes of interest related to disease phenotypes and treatment responses. Cooperative learning, a recently proposed method, unifies the commonly-used fusion approaches, including early and late fusion, and offers a systematic framework for leveraging the shared underlying relationships across omics to strengthen signals. However, the challenge of acquiring large-scale labeled data remains, and there are cases where multiomics data are available but in the absence of annotated labels. To harness the potential of unlabeled multiomcis data, we introduce semi-supervised cooperative learning. By utilizing an "agreement penalty", our method incorporates the additional unlabeled data in the learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Single-cell and spatial transcriptomics