# MapReduce Algorithms for Inferring Gene Regulatory Networks from   Time-Series Microarray Data Using an Information-Theoretic Approach

**Authors:** Yasser Abduallah, Turki Turki, Kevin Byron, Zongxuan Du, Miguel, Cervantes-Cervantes, Jason T. L. Wang

arXiv: 1704.06548 · 2017-04-24

## TL;DR

This paper introduces new MapReduce algorithms that leverage cloud computing to efficiently infer gene regulatory networks from time-series microarray data, significantly improving speed and slightly enhancing prediction accuracy.

## Contribution

The paper presents novel MapReduce algorithms for GRN inference that operate efficiently on Hadoop clusters, addressing computational challenges in large-scale gene network analysis.

## Key findings

- MapReduce algorithms are faster than existing tools.
- Achieved slightly better prediction accuracy.
- Effective in analyzing large-scale time-series data.

## Abstract

Gene regulation is a series of processes that control gene expression and its extent. The connections among genes and their regulatory molecules, usually transcription factors, and a descriptive model of such connections, are known as gene regulatory networks (GRNs). Elucidating GRNs is crucial to understand the inner workings of the cell and the complexity of gene interactions. To date, numerous algorithms have been developed to infer gene regulatory networks. However, as the number of identified genes increases and the complexity of their interactions is uncovered, networks and their regulatory mechanisms become cumbersome to test. Furthermore, prodding through experimental results requires an enormous amount of computation, resulting in slow data processing. Therefore, new approaches are needed to expeditiously analyze copious amounts of experimental data resulting from cellular GRNs. To meet this need, cloud computing is promising as reported in the literature. Here we propose new MapReduce algorithms for inferring gene regulatory networks on a Hadoop cluster in a cloud environment. These algorithms employ an information-theoretic approach to infer GRNs using time-series microarray data. Experimental results show that our MapReduce program is much faster than an existing tool while achieving slightly better prediction accuracy than the existing tool.

---
Source: https://tomesphere.com/paper/1704.06548