Frequency Moments in Noisy Streaming and Distributed Data under Mismatch Ambiguity
Kaiwen Liu, Qin Zhang

TL;DR
This paper introduces a new framework for estimating frequency moments in noisy data streams and distributed systems, revealing increased complexity due to noise and identifying conditions for efficient approximation.
Contribution
It presents a novel data-dependent measure called $F_p$-mismatch-ambiguity and establishes tight lower bounds, advancing understanding of noisy data estimation.
Findings
Estimating $F_p$ in noisy streams requires polynomial space, unlike in noiseless cases.
Achieving polylogarithmic communication for $F_p$ under noise is generally impossible, but becomes feasible below a certain ambiguity threshold.
Abstract
We propose a novel framework for statistical estimation on noisy datasets. Within this framework, we focus on the frequency moments () problem and demonstrate that it is possible to approximate of the unknown ground-truth dataset using sublinear space in the data stream model and sublinear communication in the coordinator model, provided that the approximation ratio is parameterized by a data-dependent quantity, which we call the -mismatch-ambiguity. We also establish a set of lower bounds, which are tight in terms of the input size. Our results yield several interesting insights: (1) In the data stream model, the problem is inherently more difficult in the noisy setting than in the noiseless one. In particular, while can be approximated in logarithmic space in terms of the input size in the noiseless setting, any algorithm for in the noisy setting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Sensor Networks and Detection Algorithms · Privacy-Preserving Technologies in Data · Data Stream Mining Techniques
