A Mixture Model to Detect Edges in Sparse Co-expression Graphs

Haim Bar; Seojin Bang

arXiv:1804.01185·stat.AP·June 29, 2021

A Mixture Model to Detect Edges in Sparse Co-expression Graphs

Haim Bar, Seojin Bang

PDF

TL;DR

This paper introduces a novel mixture model for detecting edges in sparse gene co-expression networks, improving detection power and false discovery rate control without assuming specific network structures.

Contribution

It develops a three-component mixture model that effectively identifies gene network edges from co-expression data, outperforming existing methods.

Findings

01

The L2N mixture model outperforms other methods in edge detection power.

02

It effectively controls the false discovery rate.

03

The method makes no assumptions about the true network structure.

Abstract

In the early days of gene expression data, researchers have focused on gene-level analysis, and particularly on finding differentially expressed genes. This usually involved making a simplifying assumption that genes are independent, which made likelihood derivations feasible and allowed for relatively simple implementations. In recent years, the scope has expanded to include pathway and `gene set' analysis in an attempt to understand the relationships between genes. We develop a method to recover a gene network's structure from co-expression data, measured in terms of normalized Pearson's correlation coefficients between gene pairs. We treat these co-expression measurements as weights in the complete graph in which nodes correspond to genes. To decide which edges exist in the gene network, we fit a three-component mixture model such that the observed weights of `null edges' follow a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.