No Metric to Rule Them All: Toward Principled Evaluations of Graph-Learning Datasets
Corinna Coupette, Jeremy Wayland, Emily Simons, Bastian Rieck

TL;DR
This paper introduces a principled framework for evaluating graph-learning datasets by analyzing their structure and features through mode-perturbation, aiming to improve benchmarking practices in graph learning.
Contribution
It proposes a novel mode-perturbation framework called Rings and introduces two measures, performance separability and mode complementarity, for systematic dataset evaluation.
Findings
Framework effectively assesses dataset quality through ablation studies.
Experimental results highlight the importance of dataset properties for benchmarking.
Recommendations improve the robustness of graph-learning evaluations.
Abstract
Benchmark datasets have proved pivotal to the success of graph learning, and good benchmark datasets are crucial to guide the development of the field. Recent research has highlighted problems with graph-learning datasets and benchmarking practices -- revealing, for example, that methods which ignore the graph structure can outperform graph-based approaches. Such findings raise two questions: (1) What makes a good graph-learning dataset, and (2) how can we evaluate dataset quality in graph learning? Our work addresses these questions. As the classic evaluation setup uses datasets to evaluate models, it does not apply to dataset evaluation. Hence, we start from first principles. Observing that graph-learning datasets uniquely combine two modes -- graph structure and node features --, we introduce Rings, a flexible and extensible mode-perturbation framework to assess the quality of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks
