A statistical framework for testing functional categories in microarray data
William T. Barry, Andrew B. Nobel, Fred A. Wright

TL;DR
This paper introduces a new statistical framework for testing functional gene categories in microarray data, addressing limitations of existing methods by accounting for gene correlation and proposing bootstrap techniques for improved accuracy.
Contribution
It presents a comprehensive framework that includes new null hypotheses and bootstrap methods, enhancing the validity and power of gene set testing in microarray analysis.
Findings
Class 1 tests are anti-conservative due to ignoring gene correlation.
Class 2 tests are conservative under the new null hypothesis.
Bootstrap methods outperform permutation tests in power and error control.
Abstract
Ready access to emerging databases of gene annotation and functional pathways has shifted assessments of differential expression in DNA microarray studies from single genes to groups of genes with shared biological function. This paper takes a critical look at existing methods for assessing the differential expression of a group of genes (functional category), and provides some suggestions for improved performance. We begin by presenting a general framework, in which the set of genes in a functional category is compared to the complementary set of genes on the array. The framework includes tests for overrepresentation of a category within a list of significant genes, and methods that consider continuous measures of differential expression. Existing tests are divided into two classes. Class 1 tests assume gene-specific measures of differential expression are independent, despite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
