Communication Complexity of Distributed High Dimensional Correlation Testing
K. R. Sahasranand, Himanshu Tyagi

TL;DR
This paper investigates the communication complexity of testing correlation in high-dimensional distributed data, establishing tight bounds and showing the efficiency of direct testing over estimation methods.
Contribution
It provides tight bounds on the communication needed for distributed correlation testing and demonstrates the suboptimality of estimate-and-test approaches.
Findings
Order $d/\tau^2$ bits are sufficient and necessary for correlation testing.
Estimate-and-test approach requires roughly $d^2/\tau^2$ bits, which is suboptimal.
Bounds are tight for the one-dimensional one-way communication case.
Abstract
Two parties observe independent copies of a -dimensional vector and a scalar. They seek to test if their data is correlated or not, namely they seek to test if the norm of the correlation vector between their observations exceeds or is it . To that end, they communicate interactively and declare the output of the test. We show that roughly order bits of communication are sufficient and necessary for resolving the distributed correlation testing problem above. Furthermore, we establish a lower bound of roughly bits for communication needed for distributed correlation estimation, rendering the estimate-and-test approach suboptimal in communication required for distributed correlation testing. For the one-dimensional case with one-way communication, our bounds are tight even in the constant and provide a precise dependence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWireless Communication Security Techniques · Distributed Sensor Networks and Detection Algorithms · Privacy-Preserving Technologies in Data
