Spectral Greedy Coresets for Graph Neural Networks

Mucong Ding; Yinhan He; Jundong Li; Furong Huang

arXiv:2405.17404·cs.LG·May 28, 2024

Spectral Greedy Coresets for Graph Neural Networks

Mucong Ding, Yinhan He, Jundong Li, Furong Huang

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a spectral greedy coreset method for GNNs that efficiently selects representative ego-graphs to accelerate training on large, complex graphs, outperforming existing methods in speed and accuracy.

Contribution

The paper proposes a novel spectral greedy algorithm for graph coreset selection that effectively handles node interdependence and scales to large graphs, improving GNN training efficiency.

Findings

01

SGGC outperforms other coreset methods significantly.

02

It generalizes well across different GNN architectures.

03

It is much faster than graph condensation techniques.

Abstract

The ubiquity of large-scale graphs in node-classification tasks significantly hinders the real-world applications of Graph Neural Networks (GNNs). Node sampling, graph coarsening, and dataset condensation are effective strategies for enhancing data efficiency. However, owing to the interdependence of graph nodes, coreset selection, which selects subsets of the data examples, has not been successfully applied to speed up GNN training on large graphs, warranting special treatment. This paper studies graph coresets for GNNs and avoids the interdependence issue by selecting ego-graphs (i.e., neighborhood subgraphs around a node) based on their spectral embeddings. We decompose the coreset selection problem for GNNs into two phases: a coarse selection of widely spread ego graphs and a refined selection to diversify their topologies. We design a greedy algorithm that approximately optimizes…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

1. This paper provides a new perspective on accelerating the training of GNNs, i.e., the coreset selection. Simply transferring the traditional coreset algorithms into graph data is not an optimal solution. This paper overcomes the complexity dependence between nodes and proposes an effective method. 2. By leveraging the spectral embedding of nodes, which represents the node positions in a graph, SGGC can select the coreset ego-graphs with effective and diverse structures and preserve the cruc

Weaknesses

1. What is the complexity of the proposed method? Does it comparable to other graph reduction methods, e.g., coarsening and condensation. 2. Does the proposed method suitable for inductive setting? For example, in the Cora dataset, some training nodes exists in a small connected component. In this situation, does the spectral embedding still work?

Reviewer 02Rating 8· accept, good paperConfidence 4

Strengths

Pro: - Very well written and easy to follow each step, with intuitive explanation and detailed proof defer to appendix. Adding alias to equation and definitions make it much easier to follow the context without tracing back to equation numbers. - Overall very interesting idea and well executed. Decoupling the sampling step and loss optimization makes a lot of sense and well justified by both theoretical guarantee and extensive experimental results. - Step by step decomposition of final objecti

Weaknesses

Cons: - Small typos in second paragraph of “Graph spectral domain” it should be “1 \geq 1 - 1/2 \lambda_1 \geq … \geq 1 - 1/2 \lambda_n \geq 0. “ - The notation of $\tidle{Z}_i$ is a bit confusing. It is using the diffusion ego-graph $\tidle(G)_i$ for A and X, but has $G_i$ in them. Maybe I am miss understanding the notation here but it’s hard to connect it back with p for the RSD analysis. - Figures y-axis show accuracy value between 0 and 1, yet y label says %. Is this also a typo or the acc

Reviewer 03Rating 3· reject, not good enoughConfidence 4

Strengths

1. The scalability issue of GNNs is an important challenge. 2. I think this is the first work to study coresets for GNNs.

Weaknesses

1. I think the formulation of the coreset problem proposed in the paper is not practical. Basically, it only aims to reduce the size of the training set, but at training time, the ego-graphs of them still need to be loaded, which could still be huge because of the neighbor explosion phenomenon. This is also the reason why plain mini-batch training is not applicable in the first place. The coreset problem proposed in this paper doesn't solve the memory usage challenge faced by mini-batch training

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Neural Networks and Applications · Graph Theory and Algorithms

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Coresets