Nearly Space-Optimal Graph and Hypergraph Sparsification in Insertion-Only Data Streams
Vincent Cohen-Addad, David P. Woodruff, Shenghao Xie, Samson Zhou

TL;DR
This paper introduces nearly space-optimal algorithms for graph and hypergraph sparsification in insertion-only data streams, achieving approximation guarantees close to offline algorithms with significantly reduced space complexity.
Contribution
It presents new streaming algorithms for hypergraph and graph sparsification that match offline sample complexity up to poly-logarithmic factors, and introduces space-efficient algorithms for min-cut and adversarial robustness.
Findings
Achieves $(1+\varepsilon)$-approximation for hypergraph sparsification with near-optimal space.
Improves space bounds for graph sparsification by a factor of $\log n$.
Provides space-efficient algorithms for min-cut approximation and adversarially robust hypergraph sparsification.
Abstract
We study the problem of graph and hypergraph sparsification in insertion-only data streams. The input is a hypergraph with nodes, hyperedges, and rank , and the goal is to compute a hypergraph that preserves the energy of each vector in , up to a small multiplicative error. In this paper, we give a streaming algorithm that achieves a -approximation, using bits of space, matching the sample complexity of the best known offline algorithm up to factors. Our approach also provides a streaming algorithm for graph sparsification that achieves a -approximation, using bits of space, improving the current bound by factors.…
Peer Reviews
Decision·ICLR 2026 Poster
The paper improves the bound for the insertion-only streaming model by a multiplicative factor that can grow up to $n$, the number of vertices. They also propose better bounds for several other models, including resolving an open question of Soma et al. (2024) in the online hypergraph case.
The paper is structured poorly and difficult to read. The main results are not presented clearly. The comparison with other works (Appendix A, Fig. 3) should have been presented in the main paper. The absence of a technical overview or a clear presentation of the main technical challenges and proposed solutions makes it difficult to comprehend the core ideas of the paper. The comparison with Khanna et al. 2025a seems a bit unfair, given that their algorithm is for the dynamic setting, althoug
Spectral sparsification of graphs and hypergraphs are important problems from both a theoretical and practical perspective and studying these problems in streaming and online settings seem quite natural. It is impressive that the paper gets improved algorithms for so many of these problems. Especially Theorem 1.1 (nearly tight bounds) and 1.3 (removing dependencies on $\log m$ and $r$) seem nice. The paper provides experiments that demonstrate that a version of their algorithm performs quite fa
The paper might be difficult to read as the techniques are quite technical. It was somewhat difficult for me to assess the novelty of the ideas due to this technical opacity. It is however quite possible that people stronger in linear algebra and spectral methods find it more understandable. Another weakness is that the polynomial update time in $n$ might be a little disappointing, but I don't think this is a big issue.
The problem is important, and the paper delivers near–space-optimal $(1\pm\varepsilon)$ sparsification in insertion-only streams for graphs and hypergraphs, shaving prior $\log n$ factors and extending to min-cut, adversarially robust, and sliding-window settings—useful breadth for streaming theory and practice.
The problem is important, but the results feel incremental—they mainly tighten log factors and integrate known techniques, with the novelty lying more in careful refinement and synthesis than in new methods. Experiments are light: few/small datasets, no hypergraph or min-cut evaluations, no sliding-window/adversarial tests, limited comparisons, and few metrics ( no throughput or memory-per-update reporting).
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Graph Theory and Algorithms · Advanced Graph Neural Networks
