Universal Streaming
Vladimir Braverman, Rafail Ostrovsky, Alan Roytman

TL;DR
This paper introduces a universal data collection method during streaming that enables the computation of various statistics after the stream, using small memory, with proven bounds for both standard and sliding window models.
Contribution
It presents the first universal streaming data collection technique with matching upper and lower bounds, enabling post-stream computation of diverse statistics with small space.
Findings
Universal statistics of polylogarithmic size are achievable.
Post-stream computation of any statistic with similar space complexity is possible.
Results apply to both unbounded and sliding window streaming models.
Abstract
Given a stream of data, a typical approach in streaming algorithms is to design a sophisticated algorithm with small memory that computes a specific statistic over the streaming data. Usually, if one wants to compute a different statistic after the stream is gone, it is impossible. But what if we want to compute a different statistic after the fact? In this paper, we consider the following fascinating possibility: can we collect some small amount of specific data during the stream that is "universal," i.e., where we do not know anything about the statistics we will want to later compute, other than the guarantee that had we known the statistic ahead of time, it would have been possible to do so with small memory? In other words, is it possible to collect some data in small space during the stream, such that any other statistic that can be computed with comparable space can be computed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Management and Algorithms · Algorithms and Data Compression
