Time and Memory Efficient Parallel Algorithm for Structural Graph Summaries and two Extensions to Incremental Summarization and $k$-Bisimulation for Long $k$-Chaining
Till Blume, Jannik Rau, David Richerby, Ansgar Scherp

TL;DR
This paper presents a scalable parallel algorithm for graph summarization, including incremental updates and $k$-bisimulation extensions, demonstrating high efficiency and low memory overhead on large real-world datasets.
Contribution
The paper introduces a flexible parallel algorithm supporting various graph summaries, with novel incremental and hash-based $k$-bisimulation extensions, validated through extensive empirical evaluation.
Findings
Incremental algorithm outperforms batch computation in speed.
Supports graphs with over 100 million edges within minutes.
Requires only 8% additional memory for incremental updates.
Abstract
We developed a flexible parallel algorithm for graph summarization based on vertex-centric programming and parameterized message passing. The base algorithm supports infinitely many structural graph summary models defined in a formal language. An extension of the parallel base algorithm allows incremental graph summarization. In this paper, we prove that the incremental algorithm is correct and show that updates are performed in time , where is the number of additions, deletions, and modifications to the input graph, the maximum degree, and is the maximum distance in the subgraphs considered. Although the iterative algorithm supports values of , it requires nested data structures for the message passing that are memory-inefficient. Thus, we extended the base summarization algorithm by a hash-based messaging mechanism to support a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Data Quality and Management
