Diverse correlation structures in gene expression data and their utility   in improving statistical inference

Lev Klebanov; Andrei Yakovlev

arXiv:0712.2130·stat.AP·December 18, 2007

Diverse correlation structures in gene expression data and their utility in improving statistical inference

Lev Klebanov, Andrei Yakovlev

PDF

TL;DR

This paper reveals complex correlation patterns in gene expression data that can be exploited to improve statistical inference, especially in testing differential gene expression with greater accuracy and robustness.

Contribution

It demonstrates that correlation structures in microarray data contain valuable information that enhances statistical methods beyond traditional approaches.

Findings

01

Identification of distinct correlation substructures in gene expression data

02

A new method for testing differential expression with improved error control

03

Correlation analysis offers insights with broad biological and statistical implications

Abstract

It is well known that correlations in microarray data represent a serious nuisance deteriorating the performance of gene selection procedures. This paper is intended to demonstrate that the correlation structure of microarray data provides a rich source of useful information. We discuss distinct correlation substructures revealed in microarray gene expression data by an appropriate ordering of genes. These substructures include stochastic proportionality of expression signals in a large percentage of all gene pairs, negative correlations hidden in ordered gene triples, and a long sequence of weakly dependent random variables associated with ordered pairs of genes. The reported striking regularities are of general biological interest and they also have far-reaching implications for theory and practice of statistical methods of microarray data analysis. We illustrate the latter point with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.