A Clustering Approach to Integrative Analysis of Multiomic Cancer Data

Dongyan Yan; Subharup Guha

arXiv:2202.04799·stat.ME·February 11, 2022

A Clustering Approach to Integrative Analysis of Multiomic Cancer Data

Dongyan Yan, Subharup Guha

PDF

Open Access

TL;DR

This paper introduces a Bayesian nonparametric clustering framework for integrating multi-omics cancer data, enabling identification of tumor subtypes and relevant genomic aberrations to improve cancer classification and understanding.

Contribution

It presents a novel scalable probabilistic model that simultaneously clusters samples and probes, incorporating dependencies across multiple omics domains for cancer data analysis.

Findings

01

Successfully applied to lung cancer data from TCGA

02

Accurately identified tumor subtypes and genomic drivers

03

Demonstrated scalability and flexibility of the method

Abstract

Rapid technological advances have allowed for molecular profiling across multiple omics domains from a single sample for clinical decision making in many diseases, especially cancer. As tumor development and progression are dynamic biological processes involving composite genomic aberrations, key challenges are to effectively assimilate information from these domains to identify genomic signatures and biological entities that are druggable, develop accurate risk prediction profiles for future patients, and identify novel patient subgroups for tailored therapy and monitoring. We propose integrative probabilistic frameworks for high-dimensional multiple-domain cancer data that coherently incorporate dependence within and between domains to accurately detect tumor subtypes, thus providing a catalogue of genomic aberrations associated with cancer taxonomy. We propose an innovative,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Bayesian Methods and Mixture Models