Multi-task Deep Neural Networks in Automated Protein Function Prediction
Ahmet Sureyya Rifaioglu, Tunca Do\u{g}an, Maria Jesus Martin, Rengul, Cetin-Atalay, Mehmet Volkan Atalay

TL;DR
This paper presents a hierarchical multi-task deep neural network architecture for protein function prediction using Gene Ontology terms, demonstrating improved performance and exploring factors affecting accuracy.
Contribution
The study introduces a novel multi-task deep learning model for protein function prediction and analyzes the impact of dataset size, GO term hierarchy, and noisy data inclusion.
Findings
Performance improves with larger training datasets.
No correlation between GO term depth and prediction accuracy.
Including noisy annotations can enhance performance for low-performing GO terms.
Abstract
In recent years, deep learning algorithms have outperformed the state-of-the art methods in several areas thanks to the efficient methods for training and for preventing overfitting, advancement in computer hardware, the availability of vast amount data. The high performance of multi-task deep neural networks in drug discovery has attracted the attention to deep learning algorithms in bioinformatics area. Here, we proposed a hierarchical multi-task deep neural network architecture based on Gene Ontology (GO) terms as a solution to protein function prediction problem and investigated various aspects of the proposed architecture by performing several experiments. First, we showed that there is a positive correlation between performance of the system and the size of training datasets. Second, we investigated whether the level of GO terms on GO hierarchy related to their performance. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Bioinformatics · Bioinformatics and Genomic Networks
