Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks
Kenta Oono, Taiji Suzuki

TL;DR
This paper provides a theoretical analysis of multi-scale GNNs using boosting theory, demonstrating their convergence and generalization guarantees, and applies boosting algorithms to improve node prediction tasks.
Contribution
It offers the first theoretical explanation for multi-scale GNNs' effectiveness against over-smoothing and applies boosting to enhance GNN training.
Findings
Convergence of training error under weak learning conditions.
Test error bounds decrease with node aggregations.
Boosting improves GNN performance on real-world tasks.
Abstract
It is known that the current graph neural networks (GNNs) are difficult to make themselves deep due to the problem known as over-smoothing. Multi-scale GNNs are a promising approach for mitigating the over-smoothing problem. However, there is little explanation of why it works empirically from the viewpoint of learning theory. In this study, we derive the optimization and generalization guarantees of transductive learning algorithms that include multi-scale GNNs. Using the boosting theory, we prove the convergence of the training error under weak learning-type conditions. By combining it with generalization gap bounds in terms of transductive Rademacher complexity, we show that a test error bound of a specific type of multi-scale GNNs that decreases corresponding to the number of node aggregations under some conditions. Our results offer theoretical explanations for the effectiveness of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning and Algorithms · Graph Theory and Algorithms
