Learning Directed-Acyclic-Graphs from Large-Scale Genomics Data
Fabio Nikolay, Marius Pesavento, George Kritikos, Nassos Typas

TL;DR
This paper introduces GENIE, a novel linear integer optimization method for learning gene interaction DAGs from noisy double knockout data, enhanced by GI-profile data and a scalable sequential approach, outperforming traditional methods.
Contribution
The paper presents a new optimization-based framework, GENIE, for accurately inferring genetic interaction networks from large-scale noisy data, with extensions for improved detection and scalability.
Findings
GENIE outperforms conventional techniques in simulations.
Incorporating GI-profile data enhances detection accuracy.
The sequential scalability method enables analysis of large gene sets.
Abstract
In this paper we consider the problem of learning the genetic-interaction-map, i.e., the topology of a directed acyclic graph (DAG) of genetic interactions from noisy double knockout (DK) data. Based on a set of well established biological interaction models we detect and classify the interactions between genes. We propose a novel linear integer optimization program called the Genetic-Interactions-Detector (GENIE) to identify the complex biological dependencies among genes and to compute the DAG topology that matches the DK measurements best. Furthermore, we extend the GENIE-program by incorporating genetic-interactions-profile (GI-profile) data to further enhance the detection performance. In addition, we propose a sequential scalability technique for large sets of genes under study, in order to provide statistically stressable results for real measurement data. Finally, we show via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Genomics and Chromatin Dynamics
