# Mining Functional Modules by Multiview-NMF of Phenome-Genome Association

**Authors:** YaoGong Zhang, YingJie Xu, Xin Fan, YuXiang Hong, Jiahui Liu, ZhiCheng, He, YaLou Huang, MaoQiang Xie

arXiv: 1705.03998 · 2017-05-12

## TL;DR

This paper introduces a hierarchical NMF-based method called CMNMF that leverages phenotype ontology structure to more effectively identify biologically meaningful gene modules from gene-phenotype association data.

## Contribution

The novel CMNMF method utilizes hierarchical phenotype ontology information to improve gene module detection compared to traditional expression data-based approaches.

## Key findings

- CMNMF outperforms baseline clustering methods in gene module identification.
- Gene modules identified by CMNMF show higher biological significance.
- The method effectively predicts gene pathway members and protein interactions.

## Abstract

Background: Mining gene modules from genomic data is an important step to detect gene members of pathways or other relations such as protein-protein interactions. In this work, we explore the plausibility of detecting gene modules by factorizing gene-phenotype associations from a phenotype ontology rather than the conventionally used gene expression data. In particular, the hierarchical structure of ontology has not been sufficiently utilized in clustering genes while functionally related genes are consistently associated with phenotypes on the same path in the phenotype ontology. Results: We propose a hierarchal Nonnegative Matrix Factorization (NMF)-based method, called Consistent Multiple Nonnegative Matrix Factorization (CMNMF), to factorize genome-phenome association matrix at two levels of the hierarchical structure in phenotype ontology for mining gene functional modules. CMNMF constrains the gene clusters from the association matrices at two consecutive levels to be consistent since the genes are annotated with both the child phenotype and the parent phenotype in the consecutive levels. CMNMF also restricts the identified phenotype clusters to be densely connected in the phenotype ontology hierarchy. In the experiments on mining functionally related genes from mouse phenotype ontology and human phenotype ontology, CMNMF effectively improved clustering performance over the baseline methods. Gene ontology enrichment analysis was also conducted to reveal interesting gene modules. Conclusions: Utilizing the information in the hierarchical structure of phenotype ontology, CMNMF can identify functional gene modules with more biological significance than the conventional methods. CMNMF could also be a better tool for predicting members of gene pathways and protein-protein interactions. Availability: https://github.com/nkiip/CMNMF

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.03998/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1705.03998/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/1705.03998/full.md

---
Source: https://tomesphere.com/paper/1705.03998