Graph Sparsification in the Semi-streaming Model
Kook Jin Ahn, Sudipto Guha

TL;DR
This paper presents a one-pass semi-streaming algorithm for graph sparsification that approximates cuts within a (1+ε) factor using near-linear space, advancing the understanding of graph processing in limited memory models.
Contribution
It introduces the first single-pass semi-streaming algorithm for graph sparsification that guarantees cut approximations within a (1+ε) factor, with near-linear space complexity.
Findings
Provides a $ ilde{O}(n/ ext{ε}^2)$ space algorithm for graph sparsification.
Shows a lower bound of $ ext{Ω}(n ext{log} rac{1}{ ext{ε}})$ space for approximating min-cut in one pass.
Demonstrates that linear space is insufficient for certain graph approximation tasks in streaming models.
Abstract
Analyzing massive data sets has been one of the key motivations for studying streaming algorithms. In recent years, there has been significant progress in analysing distributions in a streaming setting, but the progress on graph problems has been limited. A main reason for this has been the existence of linear space lower bounds for even simple problems such as determining the connectedness of a graph. However, in many new scenarios that arise from social and other interaction networks, the number of vertices is significantly less than the number of edges. This has led to the formulation of the semi-streaming model where we assume that the space is (near) linear in the number of vertices (but not necessarily the edges), and the edges appear in an arbitrary (and possibly adversarial) order. In this paper we focus on graph sparsification, which is one of the major building blocks in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Caching and Content Delivery · Internet Traffic Analysis and Secure E-voting
