GSR-GNN: Training Acceleration and Memory-Saving Framework of Deep GNNs on Circuit Graph
Yuebo Luo, Shiyang Li, Yifei Feng, Vishal Kancharla, Shaoyi Huang, Caiwen Ding

TL;DR
GSR-GNN is a novel framework that enables training deep GNNs on circuit graphs efficiently, significantly reducing memory and computation while maintaining high accuracy.
Contribution
The paper introduces GSR-GNN, combining reversible modules and sparse operators to enable hundreds of layers of deep GNNs with reduced resource consumption.
Findings
Achieves up to 87.2% peak memory reduction.
Over 30x training speedup.
Negligible degradation in quality metrics.
Abstract
Graph Neural Networks (GNNs) show strong promise for circuit analysis, but scaling to modern large-scale circuit graphs is limited by GPU memory and training cost, especially for deep models. We revisit deep GNNs for circuit graphs and show that, when trainable, they significantly outperform shallow architectures, motivating an efficient, domain-specific training framework. We propose Grouped-Sparse-Reversible GNN (GSR-GNN), which enables training GNNs with up to hundreds of layers while reducing both compute and memory overhead. GSR-GNN integrates reversible residual modules with a group-wise sparse nonlinear operator that compresses node embeddings without sacrificing task-relevant information, and employs an optimized execution pipeline to eliminate fragmented activation storage and reduce data movement. On sampled circuit graphs, GSR-GNN achieves up to 87.2\% peak memory reduction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
