GoBERT: Gene Ontology Graph Informed BERT for Universal Gene Function Prediction
Yuwei Miao, Yuzhi Guo, Hehuan Ma, Jingquan Yan, Feng Jiang, Rui Liao,, Junzhou Huang

TL;DR
GoBERT leverages the Gene Ontology graph and BERT architecture to predict gene functions more accurately by capturing explicit and implicit functional relationships, reducing reliance on costly experiments.
Contribution
This paper introduces GoBERT, a novel BERT-based model that incorporates Gene Ontology graph information for improved gene function prediction, including new pre-training tasks for relation capturing.
Findings
GoBERT outperforms existing methods in gene function prediction accuracy.
Pre-training tasks effectively capture explicit and implicit function relationships.
Model demonstrates ability to predict novel gene functions.
Abstract
Exploring the functions of genes and gene products is crucial to a wide range of fields, including medical research, evolutionary biology, and environmental science. However, discovering new functions largely relies on expensive and exhaustive wet lab experiments. Existing methods of automatic function annotation or prediction mainly focus on protein function prediction with sequence, 3D-structures or protein family information. In this study, we propose to tackle the gene function prediction problem by exploring Gene Ontology graph and annotation with BERT (GoBERT) to decipher the underlying relationships among gene functions. Our proposed novel function prediction task utilizes existing functions as inputs and generalizes the function prediction to gene and gene products. Specifically, two pre-train tasks are designed to jointly train GoBERT to capture both explicit and implicit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsBioinformatics and Genomic Networks · Machine Learning in Bioinformatics · Gene expression and cancer classification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Layer Normalization · Dense Connections · Attention Dropout · WordPiece · Dropout · Linear Layer · Softmax · Linear Warmup With Linear Decay
