Otter-Knowledge: benchmarks of multimodal knowledge graph representation learning from different sources for drug discovery
Hoang Thanh Lam, Marco Luca Sbodio, Marcos Mart\'inez Galindo,, Mykhaylo Zayats, Ra\'ul Fern\'andez-D\'iaz, V\'ictor Valls, Gabriele Picco,, Cesar Berrospi Ramis, Vanessa L\'opez

TL;DR
This paper introduces Otter-Knowledge, a set of multimodal knowledge graphs from diverse sources that enhance drug-target binding affinity predictions by integrating knowledge graphs with molecular and protein representations.
Contribution
It presents a novel multimodal knowledge graph resource and pretrained models that improve drug discovery predictions, surpassing existing methods on standard benchmarks.
Findings
Achieved state-of-the-art results on TDC benchmarks.
Released a large multimodal knowledge graph with over 30 million triples.
Provided pretrained models and source code for drug binding affinity prediction.
Abstract
Recent research on predicting the binding affinity between drug molecules and proteins use representations learned, through unsupervised learning techniques, from large databases of molecule SMILES and protein sequences. While these representations have significantly enhanced the predictions, they are usually based on a limited set of modalities, and they do not exploit available knowledge about existing relations among molecules and proteins. In this study, we demonstrate that by incorporating knowledge graphs from diverse sources and modalities into the sequences or SMILES representation, we can further enrich the representation and achieve state-of-the-art results for drug-target binding affinity prediction in the established Therapeutic Data Commons (TDC) benchmarks. We release a set of multimodal knowledge graphs, integrating data from seven public data sources, and containing over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ibm-research/otter_dude_distmultmodel· ♡ 3♡ 3
- 🤗ibm-research/otter_dude_classifiermodel· ♡ 3♡ 3
- 🤗ibm-research/otter_primekg_classifiermodel· ♡ 3♡ 3
- 🤗ibm-research/otter_primekg_distmultmodel· ♡ 4♡ 4
- 🤗ibm-research/otter_dude_transemodel· ♡ 2♡ 2
- 🤗ibm-research/otter_primekg_transemodel· ♡ 2♡ 2
- 🤗ibm-research/otter_stitch_classifiermodel· ♡ 2♡ 2
- 🤗ibm-research/otter_stitch_transemodel· ♡ 2♡ 2
- 🤗ibm-research/otter_stitch_distmultmodel· ♡ 4♡ 4
- 🤗ibm-research/otter_ub_mfmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Chemical Synthesis and Analysis · Protein Structure and Dynamics
