Semi-supervised Cooperative Learning for Multiomics Data Fusion
Daisy Yi Ding, Xiaotao Shen, Michael Snyder, Robert Tibshirani

TL;DR
This paper introduces semi-supervised cooperative learning for multiomics data fusion, effectively leveraging unlabeled data to improve predictive accuracy in biological studies.
Contribution
It proposes a novel semi-supervised framework using an agreement penalty to incorporate unlabeled data into multiomics fusion, enhancing predictive performance.
Findings
Superior performance on simulated data
Effective in real aging multiomics study
Maximizes utility of labeled and unlabeled data
Abstract
Multiomics data fusion integrates diverse data modalities, ranging from transcriptomics to proteomics, to gain a comprehensive understanding of biological systems and enhance predictions on outcomes of interest related to disease phenotypes and treatment responses. Cooperative learning, a recently proposed method, unifies the commonly-used fusion approaches, including early and late fusion, and offers a systematic framework for leveraging the shared underlying relationships across omics to strengthen signals. However, the challenge of acquiring large-scale labeled data remains, and there are cases where multiomics data are available but in the absence of annotated labels. To harness the potential of unlabeled multiomcis data, we introduce semi-supervised cooperative learning. By utilizing an "agreement penalty", our method incorporates the additional unlabeled data in the learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Single-cell and spatial transcriptomics
