# DeepGO: Predicting protein functions from sequence and interactions   using a deep ontology-aware classifier

**Authors:** Maxat Kulmanov, Mohammed Asif Khan, Robert Hoehndorf

arXiv: 1705.05919 · 2017-09-28

## TL;DR

DeepGO employs a deep learning model that integrates protein sequences and interaction networks to predict protein functions within the Gene Ontology framework, significantly outperforming traditional methods like BLAST.

## Contribution

This paper introduces a novel deep ontology-aware classifier that leverages GO structure and multi-modal data for improved protein function prediction.

## Key findings

- Outperforms baseline methods such as BLAST
- Significant improvement in predicting cellular locations
- Effective integration of sequence and interaction data

## Abstract

A large number of protein sequences are becoming available through the application of novel high-throughput sequencing technologies. Experimental functional characterization of these proteins is time-consuming and expensive, and is often only done rigorously for few selected model organisms. Computational function prediction approaches have been suggested to fill this gap. The functions of proteins are classified using the Gene Ontology (GO), which contains over 40,000 classes. Additionally, proteins have multiple functions, making function prediction a large-scale, multi-class, multi-label problem.   We have developed a novel method to predict protein function from sequence. We use deep learning to learn features from protein sequences as well as a cross-species protein-protein interaction network. Our approach specifically outputs information in the structure of the GO and utilizes the dependencies between GO classes as background information to construct a deep learning model. We evaluate our method using the standards established by the Computational Assessment of Function Annotation (CAFA) and demonstrate a significant improvement over baseline methods such as BLAST, with significant improvement for predicting cellular locations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.05919/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1705.05919/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/1705.05919/full.md

---
Source: https://tomesphere.com/paper/1705.05919