GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts
Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou, Jure Leskovec

TL;DR
GraphMETRO introduces a novel Graph Neural Network architecture employing a Mixture-of-Experts approach to effectively model and mitigate complex, natural distributional shifts in graph data, achieving state-of-the-art results.
Contribution
It proposes a new MoE-based GNN architecture with a novel alignment objective to handle diverse distributional shifts in graph data.
Findings
Achieves 67% improvement on WebKB dataset.
Outperforms previous methods on four real-world datasets.
Effectively models complex distributional shifts.
Abstract
Graph data are inherently complex and heterogeneous, leading to a high natural diversity of distributional shifts. However, it remains unclear how to build machine learning architectures that generalize to the complex distributional shifts naturally occurring in the real world. Here, we develop GraphMETRO, a Graph Neural Network architecture that models natural diversity and captures complex distributional shifts. GraphMETRO employs a Mixture-of-Experts (MoE) architecture with a gating model and multiple expert models, where each expert model targets a specific distributional shift to produce a referential representation w.r.t. a reference model, and the gating model identifies shift components. Additionally, we design a novel objective that aligns the representations from different expert models to ensure reliable optimization. GraphMETRO achieves state-of-the-art results on four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Health, Environment, Cognitive Aging · Context-Aware Activity Recognition Systems
MethodsGraph Neural Network
