TL;DR
Bio-JOIE is a novel joint embedding model that integrates gene ontology and protein-protein interaction data to predict interactions, especially for SARS-CoV-2, outperforming existing methods and aiding in understanding the virus's molecular mechanisms.
Contribution
The paper introduces Bio-JOIE, a transferred multi-relational embedding model that effectively combines biological knowledge bases for improved PPI prediction and virus research.
Findings
Bio-JOIE outperforms state-of-the-art methods in PPI prediction.
It accurately identifies SARS-CoV-2 and human protein interactions.
The model enables protein clustering into functional families.
Abstract
The widespread of Coronavirus has led to a worldwide pandemic with a high mortality rate. Currently, the knowledge accumulated from different studies about this virus is very limited. Leveraging a wide-range of biological knowledge, such as gene ontology and protein-protein interaction (PPI) networks from other closely related species presents a vital approach to infer the molecular impact of a new species. In this paper, we propose the transferred multi-relational embedding model Bio-JOIE to capture the knowledge of gene ontology and PPI networks, which demonstrates superb capability in modeling the SARS-CoV-2-human protein interactions. Bio-JOIE jointly trains two model components. The knowledge model encodes the relational facts from the protein and GO domains into separated embedding spaces, using a hierarchy-aware encoding technique employed for the GO terms. On top of that, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
