Theoretical and Empirical Insights into the Origins of Degree Bias in Graph Neural Networks
Arjun Subramonian, Jian Kang, Yizhou Sun

TL;DR
This paper investigates the causes of degree bias in Graph Neural Networks, providing theoretical proofs and empirical validation, and offers strategies to mitigate this bias in real-world networks.
Contribution
It offers a comprehensive analysis of the origins of degree bias in GNNs, combining theoretical proofs with empirical validation, and proposes methods to reduce bias.
Findings
High-degree nodes have lower misclassification probability.
Degree bias is influenced by neighbor homophily and diversity.
Training can mitigate bias with sufficient epochs.
Abstract
Graph Neural Networks (GNNs) often perform better for high-degree nodes than low-degree nodes on node classification tasks. This degree bias can reinforce social marginalization by, e.g., privileging celebrities and other high-degree actors in social networks during social and content recommendation. While researchers have proposed numerous hypotheses for why GNN degree bias occurs, we find via a survey of 38 degree bias papers that these hypotheses are often not rigorously validated, and can even be contradictory. Thus, we provide an analysis of the origins of degree bias in message-passing GNNs with different graph filters. We prove that high-degree test nodes tend to have a lower probability of misclassification regardless of how GNNs are trained. Moreover, we show that degree bias arises from a variety of factors that are associated with a node's degree (e.g., homophily of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Advanced Graph Neural Networks · Imbalanced Data Classification Techniques
