Training Diverse Graph Experts for Ensembles: A Systematic Empirical Study
Gangda Deng, Yuxin Yang, \"Omer Faruk Akg\"ul, Hanqing Zeng, Yinglong Xia, Rajgopal Kannan, Viktor Prasanna

TL;DR
This paper systematically evaluates techniques for diversifying GNN experts in ensembles, demonstrating how different strategies impact performance and providing insights for designing effective MoE frameworks on graph data.
Contribution
It is the first comprehensive empirical study of expert diversification methods for GNN ensembles, analyzing 20 strategies across numerous benchmarks.
Findings
Diversification techniques vary in effectiveness for GNN ensembles.
Certain strategies significantly improve ensemble performance.
Insights into training maximally diverse GNN experts are provided.
Abstract
Graph Neural Networks (GNNs) have become essential tools for learning on relational data, yet the performance of a single GNN is often limited by the heterogeneity present in real-world graphs. Recent advances in Mixture-of-Experts (MoE) frameworks demonstrate that assembling multiple, explicitly diverse GNNs with distinct generalization patterns can significantly improve performance. In this work, we present the first systematic empirical study of expert-level diversification techniques for GNN ensembles. Evaluating 20 diversification strategies -- including random re-initialization, hyperparameter tuning, architectural variation, directionality modeling, and training data partitioning -- across 14 node classification benchmarks, we construct and analyze over 200 ensemble variants. Our comprehensive evaluation examines each technique in terms of expert diversity, complementarity, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning · Graph Theory and Algorithms
