Validating module network learning algorithms using simulated data
Tom Michoel, Steven Maere, Eric Bonnet, Anagha Joshi, Yvan Saeys, Tim, Van den Bulcke, Koenraad Van Leemput, Piet van Remortel, Martin Kuiper,, Kathleen Marchal, Yves Van de Peer

TL;DR
This paper introduces LeMoNe, a new software for learning gene expression modules using simulated data, featuring a novel hierarchical clustering approach and entropy-based regulator assignment, with advantages in speed and regulator prioritization.
Contribution
The paper presents LeMoNe, a module network learning algorithm with innovative bottom-up Bayesian clustering and entropy measures, validated on synthetic data and compared to existing methods.
Findings
LeMoNe performs comparably to Genomica on simulated data.
LeMoNe is faster for large datasets.
Regulator location and entropy can prioritize functional validation.
Abstract
In recent years, several authors have used probabilistic graphical models to learn expression modules and their regulatory programs from gene expression data. Here, we demonstrate the use of the synthetic data generator SynTReN for the purpose of testing and comparing module network learning algorithms. We introduce a software package for learning module networks, called LeMoNe, which incorporates a novel strategy for learning regulatory programs. Novelties include the use of a bottom-up Bayesian hierarchical clustering to construct the regulatory programs, and the use of a conditional entropy measure to assign regulators to the regulation program nodes. Using SynTReN data, we test the performance of LeMoNe in a completely controlled situation and assess the effect of the methodological changes we made with respect to an existing software package, namely Genomica. Additionally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
