Scalable Graph Condensation with Evolving Capabilities
Shengbo Gong, Mohammad Hashemi, Juntong Ni, Carl Yang, Wei Jin

TL;DR
This paper introduces GECC, a scalable and efficient graph condensation framework capable of handling evolving graph data streams, significantly improving speed and performance over existing static methods.
Contribution
GECC is a novel continual graph condensation method that efficiently updates condensed graphs for dynamic data without retraining, supported by strong theoretical and empirical results.
Findings
GECC outperforms state-of-the-art methods in accuracy.
GECC achieves approximately 1000× speedup on large datasets.
GECC effectively handles large-scale, evolving graph data.
Abstract
The rapid growth of graph data creates significant scalability challenges as most graph algorithms scale quadratically with size. To mitigate these issues, Graph Condensation (GC) methods have been proposed to learn a small graph from a larger one, accelerating downstream tasks. However, existing approaches critically assume a static training set, which conflicts with the inherently dynamic and evolving nature of real-world graph data. This work introduces a novel framework for continual graph condensation, enabling efficient updates to the distilled graph that handle data streams without requiring costly retraining. This limitation leads to inefficiencies when condensing growing training sets. In this paper, we introduce GECC (\underline{G}raph \underline{E}volving \underline{C}lustering \underline{C}ondensation), a scalable graph condensation method designed to handle large-scale and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques
MethodsSparse Evolutionary Training
