Informed, but Not Always Improved: Challenging the Benefit of Background Knowledge in GNNs

Kutalm{\i}\c{s} Co\c{s}kun; Ivo Kavisanczki; Amin Mirzaei; Tom Siegl; Bjarne C. Hiller; Stefan L\"udtke; Martin Becker

arXiv:2505.11023·cs.LG·September 23, 2025

Informed, but Not Always Improved: Challenging the Benefit of Background Knowledge in GNNs

Kutalm{\i}\c{s} Co\c{s}kun, Ivo Kavisanczki, Amin Mirzaei, Tom Siegl, Bjarne C. Hiller, Stefan L\"udtke, Martin Becker

PDF

Open Access

TL;DR

This paper critically examines the assumed benefits of background knowledge graphs in GNNs for biomedical tasks, revealing that their impact is often negligible and highlighting the importance of model-knowledge alignment.

Contribution

It introduces an evaluation framework to assess the true contribution of background knowledge in GNNs, demonstrating that performance gains are not guaranteed and depend on proper alignment.

Findings

01

GNNs with background knowledge often perform no better than simple models.

02

Perturbations in background knowledge have little effect on GNN performance.

03

Proper alignment of GNN architecture and background knowledge is crucial for potential improvements.

Abstract

In complex and low-data domains such as biomedical research, incorporating background knowledge (BK) graphs, such as protein-protein interaction (PPI) networks, into graph-based machine learning pipelines is a promising research direction. However, while BK is often assumed to improve model performance, its actual contribution and the impact of imperfect knowledge remain poorly understood. In this work, we investigate the role of BK in an important real-world task: cancer subtype classification. Surprisingly, we find that (i) state-of-the-art GNNs using BK perform no better than uninformed models like linear regression, and (ii) their performance remains largely unchanged even when the BK graph is heavily perturbed. To understand these unexpected results, we introduce an evaluation framework, which employs (i) a synthetic setting where the BK is clearly informative and (ii) a set of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBioinformatics and Genomic Networks · Advanced Graph Neural Networks · Machine Learning in Bioinformatics

MethodsSparse Evolutionary Training