# dCCA: detecting differential covariation patterns between two types of high-throughput omics data

**Authors:** Hwiyoung Lee, Tianzhou Ma, Hongjie Ke, Zhenyao Ye, Shuo Chen

PMC · DOI: 10.1093/bib/bbae288 · 2024-06-18

## TL;DR

This paper introduces dCCA, a new method to detect how relationships between two types of omics data differ across clinical groups, helping uncover disease-related biological patterns.

## Contribution

The novel dCCA method captures differential multivariate covariation patterns between two omics data types across clinical groups.

## Key findings

- dCCA outperforms existing methods in variable selection and recovering differential correlations in simulations.
- dCCA identified differentially expressed covariations between noncoding RNAs and gene expressions in kidney cancer data.

## Abstract

The advent of multimodal omics data has provided an unprecedented opportunity to systematically investigate underlying biological mechanisms from distinct yet complementary angles. However, the joint analysis of multi-omics data remains challenging because it requires modeling interactions between multiple sets of high-throughput variables. Furthermore, these interaction patterns may vary across different clinical groups, reflecting disease-related biological processes.

We propose a novel approach called Differential Canonical Correlation Analysis (dCCA) to capture differential covariation patterns between two multivariate vectors across clinical groups. Unlike classical Canonical Correlation Analysis, which maximizes the correlation between two multivariate vectors, dCCA aims to maximally recover differentially expressed multivariate-to-multivariate covariation patterns between groups. We have developed computational algorithms and a toolkit to sparsely select paired subsets of variables from two sets of multivariate variables while maximizing the differential covariation. Extensive simulation analyses demonstrate the superior performance of dCCA in selecting variables of interest and recovering differential correlations. We applied dCCA to the Pan-Kidney cohort from the Cancer Genome Atlas Program database and identified differentially expressed covariations between noncoding RNAs and gene expressions.

The R package that implements dCCA is available at https://github.com/hwiyoungstat/dCCA.

## Linked entities

- **Diseases:** kidney cancer (MONDO:0002367)

## Full-text entities

- **Diseases:** Cancer (MESH:D009369)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11184902/full.md

---
Source: https://tomesphere.com/paper/PMC11184902