TL;DR
SSumM is a scalable graph summarization method that produces sparse, concise summaries with minimal information loss, significantly outperforming existing techniques in compression rate, accuracy, and scalability.
Contribution
The paper introduces SSumM, a novel algorithm that combines node merging and sparsification based on the minimum description length principle, enabling efficient and effective large-scale graph summarization.
Findings
Up to 11.2X smaller summary graphs with similar error
Achieves 4.2X lower reconstruction error for similar size
Summarizes 26X larger graphs with linear scalability
Abstract
Given a graph G and the desired size k in bits, how can we summarize G within k bits, while minimizing the information loss? Large-scale graphs have become omnipresent, posing considerable computational challenges. Analyzing such large graphs can be fast and easy if they are compressed sufficiently to fit in main memory or even cache. Graph summarization, which yields a coarse-grained summary graph with merged nodes, stands out with several advantages among graph compression techniques. Thus, a number of algorithms have been developed for obtaining a concise summary graph with little information loss or equivalently small reconstruction error. However, the existing methods focus solely on reducing the number of nodes, and they often yield dense summary graphs, failing to achieve better compression rates. Moreover, due to their limited scalability, they can be applied only to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
