Weisfeiler and Leman Go Measurement Modeling: Probing the Validity of the WL Test
Arjun Subramonian, Adina Williams, Maximilian Nickel, Yizhou Sun,, Levent Sagun

TL;DR
This paper critically examines the limitations of the $k$-WL test in measuring the expressive power of graph neural networks, revealing misalignments with practitioners' beliefs and proposing benchmark-based evaluation methods.
Contribution
It uncovers the misalignments between $k$-WL and practical understanding of expressive power, and proposes benchmark-based measurement and guiding questions for better evaluation.
Findings
$k$-WL does not guarantee isometry.
$k$-WL can be irrelevant to real-world tasks.
$k$-WL may not promote generalization or trustworthiness.
Abstract
The expressive power of graph neural networks is usually measured by comparing how many pairs of graphs or nodes an architecture can possibly distinguish as non-isomorphic to those distinguishable by the -dimensional Weisfeiler-Leman (-WL) test. In this paper, we uncover misalignments between graph machine learning practitioners' conceptualizations of expressive power and -WL through a systematic analysis of the reliability and validity of -WL. We conduct a survey () of practitioners to surface their conceptualizations of expressive power and their assumptions about -WL. In contrast to practitioners' beliefs, our analysis (which draws from graph theory and benchmark auditing) reveals that -WL does not guarantee isometry, can be irrelevant to real-world graph tasks, and may not promote generalization or trustworthiness. We argue for extensional definitions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)
