Informed, but Not Always Improved: Challenging the Benefit of Background Knowledge in GNNs
Kutalm{\i}\c{s} Co\c{s}kun, Ivo Kavisanczki, Amin Mirzaei, Tom Siegl, Bjarne C. Hiller, Stefan L\"udtke, Martin Becker

TL;DR
This paper critically examines the assumed benefits of background knowledge graphs in GNNs for biomedical tasks, revealing that their impact is often negligible and highlighting the importance of model-knowledge alignment.
Contribution
It introduces an evaluation framework to assess the true contribution of background knowledge in GNNs, demonstrating that performance gains are not guaranteed and depend on proper alignment.
Findings
GNNs with background knowledge often perform no better than simple models.
Perturbations in background knowledge have little effect on GNN performance.
Proper alignment of GNN architecture and background knowledge is crucial for potential improvements.
Abstract
In complex and low-data domains such as biomedical research, incorporating background knowledge (BK) graphs, such as protein-protein interaction (PPI) networks, into graph-based machine learning pipelines is a promising research direction. However, while BK is often assumed to improve model performance, its actual contribution and the impact of imperfect knowledge remain poorly understood. In this work, we investigate the role of BK in an important real-world task: cancer subtype classification. Surprisingly, we find that (i) state-of-the-art GNNs using BK perform no better than uninformed models like linear regression, and (ii) their performance remains largely unchanged even when the BK graph is heavily perturbed. To understand these unexpected results, we introduce an evaluation framework, which employs (i) a synthetic setting where the BK is clearly informative and (ii) a set of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Advanced Graph Neural Networks · Machine Learning in Bioinformatics
MethodsSparse Evolutionary Training
