SSumM: Sparse Summarization of Massive Graphs

Kyuhan Lee; Hyeonsoo Jo; Jihoon Ko; Sungsu Lim; Kijung Shin

arXiv:2006.01060·cs.DB·February 23, 2021

SSumM: Sparse Summarization of Massive Graphs

Kyuhan Lee, Hyeonsoo Jo, Jihoon Ko, Sungsu Lim, Kijung Shin

PDF

2 Repos

TL;DR

SSumM is a scalable graph summarization method that produces sparse, concise summaries with minimal information loss, significantly outperforming existing techniques in compression rate, accuracy, and scalability.

Contribution

The paper introduces SSumM, a novel algorithm that combines node merging and sparsification based on the minimum description length principle, enabling efficient and effective large-scale graph summarization.

Findings

01

Up to 11.2X smaller summary graphs with similar error

02

Achieves 4.2X lower reconstruction error for similar size

03

Summarizes 26X larger graphs with linear scalability

Abstract

Given a graph G and the desired size k in bits, how can we summarize G within k bits, while minimizing the information loss? Large-scale graphs have become omnipresent, posing considerable computational challenges. Analyzing such large graphs can be fast and easy if they are compressed sufficiently to fit in main memory or even cache. Graph summarization, which yields a coarse-grained summary graph with merged nodes, stands out with several advantages among graph compression techniques. Thus, a number of algorithms have been developed for obtaining a concise summary graph with little information loss or equivalently small reconstruction error. However, the existing methods focus solely on reducing the number of nodes, and they often yield dense summary graphs, failing to achieve better compression rates. Moreover, due to their limited scalability, they can be applied only to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.