Investigating the Interplay between Features and Structures in Graph Learning
Daniele Castellana, Federico Errica

TL;DR
This paper empirically explores how the relationship between node features and labels affects graph learning, challenging existing metrics and providing new insights through synthetic experiments with various models.
Contribution
It introduces formal generative processes for node classification, evaluates feature influence with a new metric, and demonstrates limitations of existing metrics in relaxed assumptions.
Findings
Existing metrics are inadequate when feature-label correlation is weak.
Models can perform well on heterophilic graphs despite low feature informativeness.
Synthetic tasks reveal nuanced effects of features and structure on model performance.
Abstract
In the past, the dichotomy between homophily and heterophily has inspired research contributions toward a better understanding of Deep Graph Networks' inductive bias. In particular, it was believed that homophily strongly correlates with better node classification predictions of message-passing methods. More recently, however, researchers pointed out that such dichotomy is too simplistic as we can construct node classification tasks where graphs are completely heterophilic but the performances remain high. Most of these works have also proposed new quantitative metrics to understand when a graph structure is useful, which implicitly or explicitly assume the correlation between node features and target labels. Our work empirically investigates what happens when this strong assumption does not hold, by formalising two generative processes for node classification tasks that allow us to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Complex Network Analysis Techniques · Bioinformatics and Genomic Networks
