TinyGraph: Joint Feature and Node Condensation for Graph Neural Networks
Yezi Liu, Yanning Shen

TL;DR
TinyGraph introduces a joint condensation method for both features and nodes in large-scale graphs, significantly reducing data size while maintaining high GNN performance, thus addressing computational challenges.
Contribution
The paper proposes a novel framework for simultaneous feature and node condensation in graphs, utilizing gradient matching to preserve critical information for GNN training.
Findings
Retains over 97% of test accuracy on Cora and Citeseer datasets.
Reduces nodes by over 97% and features by 90% in experiments.
Demonstrates effective large-scale graph condensation with minimal accuracy loss.
Abstract
Training graph neural networks (GNNs) on large-scale graphs can be challenging due to the high computational expense caused by the massive number of nodes and high-dimensional nodal features. Existing graph condensation studies tackle this problem only by reducing the number of nodes in the graph. However, the resulting condensed graph data can still be cumbersome. Specifically, although the nodes of the Citeseer dataset are reduced to 0.9% (30 nodes) in training, the number of features is 3,703, severely exceeding the training sample magnitude. Faced with this challenge, we study the problem of joint condensation for both features and nodes in large-scale graphs. This task is challenging mainly due to 1) the intertwined nature of the node features and the graph structure calls for the feature condensation solver to be structure-aware; and 2) the difficulty of keeping useful information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks
