The communication complexity of distributed estimation
Parikshit Gopalan, Raghu Meka, Prasad Raghavendra, Mihir Singhal, Avi Wigderson

TL;DR
This paper investigates the communication complexity involved in distributed estimation of expected function values between two parties, introducing new protocols and bounds that improve efficiency and establish optimality for various function classes.
Contribution
It presents a novel debiasing protocol reducing communication dependence on error parameter, and provides tight bounds and lower bound techniques for specific functions like equality and greater-than.
Findings
The debiasing protocol achieves linear dependence on 1/ε, improving over quadratic.
Upper bounds are improved for equality and greater-than functions.
Many protocols are proven to be optimal through spectral and discrepancy-based lower bounds.
Abstract
We study an extension of the standard two-party communication model in which Alice and Bob hold probability distributions and over domains and , respectively. Their goal is to estimate \[ \mathbb{E}_{x \sim p,\, y \sim q}[f(x, y)] \] to within additive error for a bounded function , known to both parties. We refer to this as the distributed estimation problem. Special cases of this problem arise in a variety of areas including sketching, databases and learning. Our goal is to understand how the required communication scales with the communication complexity of and the error parameter . The random sampling approach -- estimating the mean by averaging over random samples -- requires total communication, where is the randomized communication complexity of . We design a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Machine Learning and Algorithms · Cryptography and Data Security
