TL;DR
This paper improves the efficiency of a sublinear-space streaming algorithm for maximum coverage, reducing time costs and simplifying mechanisms while maintaining approximation quality and low space complexity.
Contribution
The authors optimize a sublinear-space streaming algorithm for maximum coverage by reducing time costs, refining error bounds, and simplifying the sketching mechanism without sacrificing performance.
Findings
Optimizations do not alter space complexity or approximation quality.
Refined error bounds allow for lower independence requirements.
Simplified F0-sketching maintains solution quality with faster execution.
Abstract
Given a collection of sets from a universe , the Maximum Set Coverage problem consists of finding sets whose union has largest cardinality. This problem is NP-Hard, but the solution can be approximated by a polynomial time algorithm up to a factor . However, this algorithm does not scale well with the input size. In a streaming context, practical high-quality solutions are found, but with space complexity that scales linearly with respect to the size of the universe . However, one randomized streaming algorithm has been shown to produce a approximation of the optimal solution with a space complexity that scales only poly-logarithmically with respect to and . In order to achieve such a low space complexity, the authors used a technique called subsampling, based on independent-wise hash functions, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
