High Performance Computing of Gene Regulatory Networks using a Message-Passing Model
Kimberly Glass, John Quackenbush, Jeremy Kepner

TL;DR
This paper presents a highly efficient MATLAB/Octave implementation of the PANDA algorithm for gene regulatory network reconstruction, significantly improving speed and scalability for large biological datasets.
Contribution
The authors recast PANDA's similarity equations as matrix operations, creating a shorter, more readable, and faster MATLAB/Octave version that enhances scalability for big data applications.
Findings
M-code implementation is 20-80 times faster than C-code.
The new implementation is shorter and more modifiable.
Efficiency increases with larger network models.
Abstract
Gene regulatory network reconstruction is a fundamental problem in computational biology. We recently developed an algorithm, called PANDA (Passing Attributes Between Networks for Data Assimilation), that integrates multiple sources of 'omics data and estimates regulatory network models. This approach was initially implemented in the C++ programming language and has since been applied to a number of biological systems. In our current research we are beginning to expand the algorithm to incorporate larger and most diverse data-sets, to reconstruct networks that contain increasing numbers of elements, and to build not only single network models, but sets of networks. In order to accomplish these "Big Data" applications, it has become critical that we increase the computational efficiency of the PANDA implementation. In this paper we show how to recast PANDA's similarity equations as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
