Ripple: Scalable Incremental GNN Inferencing on Large Streaming Graphs

Pranjal Naman; Yogesh Simmhan

arXiv:2505.12112·cs.DC·December 9, 2025

Ripple: Scalable Incremental GNN Inferencing on Large Streaming Graphs

Pranjal Naman, Yogesh Simmhan

PDF

TL;DR

Ripple is a scalable framework for real-time incremental GNN inference on large, dynamic graphs, significantly reducing computation and communication costs while maintaining accuracy.

Contribution

Ripple introduces a generalized incremental programming model for efficient, exact GNN update propagation in dynamic graphs, extending to distributed systems.

Findings

01

Achieves up to 28000 updates/sec on sparse graphs

02

Provides 30x throughput improvement over baselines

03

Latency of 0.1ms to 1s suitable for near-realtime applications

Abstract

Most real-world graphs are dynamic in nature, with continuous and rapid updates to the graph topology, and vertex and edge properties. Such frequent updates pose significant challenges for inferencing over Graph Neural Networks (GNNs). Current approaches that perform vertex-wise and layer-wise inferencing are impractical for dynamic graphs as they cause redundant computations, expand to large neighborhoods, and incur high communication costs for distributed setups, resulting in slow update propagation that often exceeds real-time latency requirements. This motivates the need for streaming GNN inference frameworks that are efficient and accurate over large, dynamic graphs. We propose Ripple, a framework that performs fast incremental updates of embeddings arising due to updates to the graph topology or vertex features. Ripple provides a generalized incremental programming model,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.