# Communication Complexity of Estimating Correlations

**Authors:** Uri Hadar, Jingbo Liu, Yury Polyanskiy, Ofer Shayevitz

arXiv: 1901.09100 · 2019-04-19

## TL;DR

This paper determines the optimal communication complexity for estimating correlation coefficients between distributed Gaussian or binary variables, revealing that a simple one-way protocol achieves the best possible mean squared error rate.

## Contribution

It establishes the exact asymptotic communication cost for correlation estimation and introduces a one-way protocol that attains this optimal performance.

## Key findings

- Optimal $rac{1}{k}$ mean squared error rate with $k$ bits exchanged.
- One-way protocol suffices to achieve optimal performance.
- Lower bounds hold even for small ranges of $ho$, indicating tight bounds.

## Abstract

We characterize the communication complexity of the following distributed estimation problem. Alice and Bob observe infinitely many iid copies of $\rho$-correlated unit-variance (Gaussian or $\pm1$ binary) random variables, with unknown $\rho\in[-1,1]$. By interactively exchanging $k$ bits, Bob wants to produce an estimate $\hat\rho$ of $\rho$. We show that the best possible performance (optimized over interaction protocol $\Pi$ and estimator $\hat \rho$) satisfies $\inf_{\Pi \hat\rho}\sup_\rho \mathbb{E} [|\rho-\hat\rho|^2] = \tfrac{1}{k} (\frac{1}{2 \ln 2} + o(1))$. Curiously, the number of samples in our achievability scheme is exponential in $k$; by contrast, a naive scheme exchanging $k$ samples achieves the same $\Omega(1/k)$ rate but with a suboptimal prefactor. Our protocol achieving optimal performance is one-way (non-interactive). We also prove the $\Omega(1/k)$ bound even when $\rho$ is restricted to any small open sub-interval of $[-1,1]$ (i.e. a local minimax lower bound). Our proof techniques rely on symmetric strong data-processing inequalities and various tensorization techniques from information-theoretic interactive common-randomness extraction. Our results also imply an $\Omega(n)$ lower bound on the information complexity of the Gap-Hamming problem, for which we show a direct information-theoretic proof.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.09100/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/1901.09100/full.md

---
Source: https://tomesphere.com/paper/1901.09100