D3-GNN: Dynamic Distributed Dataflow for Streaming Graph Neural Networks
Rustam Guliyev, Aparajita Haldar, Hakan Ferhatosmanoglu

TL;DR
D3-GNN is a novel distributed system for streaming graph neural networks that efficiently handles real-time updates, achieving significant improvements in throughput and latency over existing solutions.
Contribution
It introduces a hybrid-parallel, fault-tolerant streaming GNN system with innovative data management and windowed algorithms for dynamic, large-scale graph processing.
Findings
Achieves 76x throughput improvement over DGL for streaming workloads.
Reduces running times by 10x and message volume by 15x with windowed techniques.
Demonstrates high efficiency and scalability on large-scale graph streams.
Abstract
Graph Neural Network (GNN) models on streaming graphs entail algorithmic challenges to continuously capture its dynamic state, as well as systems challenges to optimize latency, memory, and throughput during both inference and training. We present D3-GNN, the first distributed, hybrid-parallel, streaming GNN system designed to handle real-time graph updates under online query setting. Our system addresses data management, algorithmic, and systems challenges, enabling continuous capturing of the dynamic state of the graph and updating node representations with fault-tolerance and optimal latency, load-balance, and throughput. D3-GNN utilizes streaming GNN aggregators and an unrolled, distributed computation graph architecture to handle cascading graph updates. To counteract data skew and neighborhood explosion issues, we introduce inter-layer and intra-layer windowed forward pass…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
