Distributed representations of graphs for drug pair scoring
Paul Scherer, Pietro Li\`o, Mateja Jamnik

TL;DR
This paper investigates the use of distributed graph representations in drug pair scoring, demonstrating their effectiveness across various models and tasks, and releases these embeddings publicly.
Contribution
It introduces a methodology for learning and integrating distributed graph representations into drug pair scoring models, improving their performance.
Findings
Embeddings improve performance across multiple models and tasks.
Distributed representations are effective despite dataset growth and updates.
Public release of drug embeddings for several datasets.
Abstract
In this paper we study the practicality and usefulness of incorporating distributed representations of graphs into models within the context of drug pair scoring. We argue that the real world growth and update cycles of drug pair scoring datasets subvert the limitations of transductive learning associated with distributed representations. Furthermore, we argue that the vocabulary of discrete substructure patterns induced over drug sets is not dramatically large due to the limited set of atom types and constraints on bonding patterns enforced by chemistry. Under this pretext, we explore the effectiveness of distributed representations of the molecular graphs of drugs in drug pair scoring tasks such as drug synergy, polypharmacy, and drug-drug interaction prediction. To achieve this, we present a methodology for learning and incorporating distributed representations of graphs within a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Metabolomics and Mass Spectrometry Studies
