TL;DR
OPA2Vec is a novel method that combines formal axioms and annotation metadata from biomedical ontologies to generate vector representations, improving similarity-based predictions like protein interactions and gene-disease associations.
Contribution
This paper introduces OPA2Vec, a new approach that integrates ontology axioms and annotations using Word2Vec to enhance biomedical entity similarity measures.
Findings
Improved protein-protein interaction prediction accuracy.
Enhanced gene-disease association prediction.
Versatile application to various biomedical ontologies.
Abstract
Motivation: Ontologies are widely used in biology for data annotation, integration, and analysis. In addition to formally structured axioms, ontologies contain meta-data in the form of annotation axioms which provide valuable pieces of information that characterize ontology classes. Annotations commonly used in ontologies include class labels, descriptions, or synonyms. Despite being a rich source of semantic information, the ontology meta-data are generally unexploited by ontology-based analysis methods such as semantic similarity measures. Results: We propose a novel method, OPA2Vec, to generate vector representations of biological entities in ontologies by combining formal ontology axioms and annotation axioms from the ontology meta-data. We apply a Word2Vec model that has been pre-trained on PubMed abstracts to produce feature vectors from our collected data. We validate our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
