Annotations for Sparse Data Streams
Amit Chakrabarti, Graham Cormode, Navin Goyal, Justin Thaler

TL;DR
This paper introduces novel annotation schemes for sparse data streams that achieve sublinear space and annotation complexity relative to the total number of updates, enabling efficient verification of large-scale, sparse datasets.
Contribution
It presents the first schemes that operate sublinearly in the total number of stream updates for sparse data, addressing problems like INDEX, SET-DISJOINTNESS, and graph problems.
Findings
Achieved sublinear space and annotation complexity for sparse data streams.
Established a lower bound ruling out certain tradeoffs for a specific problem.
Provided new insights into Merlin-Arthur communication complexity models.
Abstract
Motivated by cloud computing, a number of recent works have studied annotated data streams and variants thereof. In this setting, a computationally weak verifier (cloud user), lacking the resources to store and manipulate his massive input locally, accesses a powerful but untrusted prover (cloud service). The verifier must work within the restrictive data streaming paradigm. The prover, who can annotate the data stream as it is read, must not just supply the answer but also convince the verifier of its correctness. Ideally, both the amount of annotation and the space used by the verifier should be sublinear in the relevant input size parameters. A rich theory of such algorithms -- which we call schemes -- has emerged. Prior work has shown how to leverage the prover's power to efficiently solve problems that have no non-trivial standard data stream algorithms. However, while optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
