Demystifying Drug Repurposing Domain Comprehension with Knowledge Graph Embedding
Edoardo Ramalli, Alberto Parravicini, Guido Walter Di Donato, Mirko, Salaris, C\'eline Hudelot, Marco Domenico Santambrogio

TL;DR
This paper presents a methodology to interpret and improve knowledge graph embedding models for drug repurposing, reducing computational costs while maintaining or enhancing prediction accuracy.
Contribution
It introduces a structured approach to understand model behavior, identify key graph elements, and optimize embeddings for drug repurposing tasks.
Findings
Reduced training set by 11.05% with only 2% accuracy loss
Decreased embedding space by 31.87% while maintaining performance
Increased accuracy by 60% on ogbl-biokg with minimal new data
Abstract
Drug repurposing is more relevant than ever due to drug development's rising costs and the need to respond to emerging diseases quickly. Knowledge graph embedding enables drug repurposing using heterogeneous data sources combined with state-of-the-art machine learning models to predict new drug-disease links in the knowledge graph. As in many machine learning applications, significant work is still required to understand the predictive models' behavior. We propose a structured methodology to understand better machine learning models' results for drug repurposing, suggesting key elements of the knowledge graph to improve predictions while saving computational resources. We reduce the training set of 11.05% and the embedding space by 31.87%, with only a 2% accuracy reduction, and increase accuracy by 60% on the open ogbl-biokg graph adding only 1.53% new triples.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Biomedical Text Mining and Ontologies · Bioinformatics and Genomic Networks
