# Confident difference criterion: a new Bayesian differentially expressed gene selection algorithm with applications

**Authors:** Fang Yu, Ming-Hui Chen, Lynn Kuo, Heather Talbott, John S. Davis

PMC · DOI: 10.1186/s12859-015-0664-3 · 2015-08-07

## TL;DR

This paper introduces a new Bayesian method for identifying genes that are differentially expressed between different conditions, which outperforms existing methods in both simulated and real data.

## Contribution

The confident difference criterion is a novel Bayesian gene selection algorithm that improves detection of differentially expressed genes.

## Key findings

- The confident difference criterion methods outperform existing methods in identifying differentially expressed genes in simulations.
- The proposed methods successfully identified more clinically important genes in a real dataset.
- Theoretical connections are established between the new method and Bayes factor approaches under normal-normal models.

## Abstract

Recently, the Bayesian method becomes more popular for analyzing high dimensional gene expression data as it allows us to borrow information across different genes and provides powerful estimators for evaluating gene expression levels. It is crucial to develop a simple but efficient gene selection algorithm for detecting differentially expressed (DE) genes based on the Bayesian estimators.

In this paper, by extending the two-criterion idea of Chen et al. (Chen M-H, Ibrahim JG, Chi Y-Y. A new class of mixture models for differential gene expression in DNA microarray data. J Stat Plan Inference. 2008;138:387–404), we propose two new gene selection algorithms for general Bayesian models and name these new methods as the confident difference criterion methods. One is based on the standardized differences between two mean expression values among genes; the other adds the differences between two variances to it. The proposed confident difference criterion methods first evaluate the posterior probability of a gene having different gene expressions between competitive samples and then declare a gene to be DE if the posterior probability is large. The theoretical connection between the proposed first method based on the means and the Bayes factor approach proposed by Yu et al. (Yu F, Chen M-H, Kuo L. Detecting differentially expressed genes using alibrated Bayes factors. Statistica Sinica. 2008;18:783–802) is established under the normal-normal-model with equal variances between two samples. The empirical performance of the proposed methods is examined and compared to those of several existing methods via several simulations. The results from these simulation studies show that the proposed confident difference criterion methods outperform the existing methods when comparing gene expressions across different conditions for both microarray studies and sequence-based high-throughput studies. A real dataset is used to further demonstrate the proposed methodology. In the real data application, the confident difference criterion methods successfully identified more clinically important DE genes than the other methods.

The confident difference criterion method proposed in this paper provides a new efficient approach for both microarray studies and sequence-based high-throughput studies to identify differentially expressed genes.

The online version of this article (doi:10.1186/s12859-015-0664-3) contains supplementary material, which is available to authorized users.

## Full-text entities

- **Genes:** TNF (tumor necrosis factor) [NCBI Gene 280943] {aka TNF-a, TNF-alpha, TNFa}, NR3C1 (nuclear receptor subfamily 3 group C member 1) [NCBI Gene 281946] {aka GR-A}, IL1B (interleukin 1 beta) [NCBI Gene 281251], THBS1 (thrombospondin 1) [NCBI Gene 281530] {aka THBS}, CXCL8 (C-X-C motif chemokine ligand 8) [NCBI Gene 280828] {aka IL-8, IL8}, CCL3 (chemokine (C-C motif) ligand 3) [NCBI Gene 282170] {aka CCL3L1}, PGF (placental growth factor) [NCBI Gene 280894] {aka PlGF}, CCL8 [NCBI Gene 788169], PTGS2 (prostaglandin-endoperoxide synthase 2) [NCBI Gene 282023], IFNG (interferon gamma) [NCBI Gene 281237], IL17A (interleukin 17A) [NCBI Gene 282863] {aka IL-17, IL17}, TGFB1 (transforming growth factor beta 1) [NCBI Gene 282089]
- **Diseases:** arthritis (MESH:D001168), hepatic fibrosis (MESH:D008103), DE (MESH:D001039), tumor (MESH:D009369)
- **Chemicals:** prostaglandin (MESH:D011453), prostaglandin F2alpha (MESH:D015237), T (MESH:D014316)
- **Species:** Bos taurus (bovine, species) [taxon 9913]

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC4527130/full.md

---
Source: https://tomesphere.com/paper/PMC4527130