A Unified Benchmark for Evaluating Knowledge Graph Construction Methods and Graph Neural Networks
Othmane Kabal, Mounira Harzallah, Fabrice Guillet, Hideaki Takeda, Ryutaro Ichise

TL;DR
This paper introduces a comprehensive benchmark for evaluating the quality of knowledge graph construction methods and the robustness of Graph Neural Networks on noisy, text-derived graphs, with a focus on biomedical data.
Contribution
It presents a unified, reproducible benchmark with multiple graph construction methods and a standardized evaluation framework for GNN performance and robustness assessment.
Findings
Benchmark includes two automatic extraction methods and a high-quality reference graph.
Systematic evaluation of GNN robustness on noisy, text-derived graphs.
Facilitates comparison of graph construction methods and GNN performance.
Abstract
Knowledge graphs automatically constructed from text are increasingly used in real-world applications. However, their inherent noise, fragmentation, and semantic inconsistencies significantly affect the performance of Graph Neural Networks (GNNs) on downstream tasks. Assessing their performance and robustness remains difficult, as it is often unclear whether observed results stem from the learning model or from the quality of the constructed graph itself. In this work, we introduce a dual-purpose benchmark designed to jointly evaluate (i) the performance of GNNs on noisy, text-derived graphs and (ii) the effectiveness of graph construction methods on a downstream task. The benchmark is built in the biomedical domain from a single textual corpus and includes two automatically constructed graphs generated using different extraction methods, alongside a high-quality reference graph curated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
