Testing Product Distributions: A Closer Look
Arnab Bhattacharyya, Sutanu Gayen, Saravanan Kandasamy, N. V., Vinodchandran

TL;DR
This paper explores the sample complexity of identity and closeness testing for high-dimensional product distributions, extending prior binary alphabet results to tolerant testing over arbitrary alphabets and measures.
Contribution
It provides a detailed analysis of how sample complexity varies with different distance measures and extends bounds to bounded-degree Bayes nets.
Findings
Sample complexity varies with distance measures in tolerant testing.
Extended bounds to bounded-degree Bayes nets.
Provided a comprehensive map of testing complexities for product distributions.
Abstract
We study the problems of identity and closeness testing of -dimensional product distributions. Prior works by Canonne, Diakonikolas, Kane and Stewart (COLT 2017) and Daskalakis and Pan (COLT 2017) have established tight sample complexity bounds for non-tolerant testing over a binary alphabet: given two product distributions and over a binary alphabet, distinguish between the cases and . We build on this prior work to give a more comprehensive map of the complexity of testing of product distributions by investigating tolerant testing with respect to several natural distance measures and over an arbitrary alphabet. Our study gives a fine-grained understanding of how the sample complexity of tolerant testing varies with the distance measures for product distributions. In addition, we also extend one of our upper bounds on product…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Machine Learning and Algorithms · Cryptography and Data Security
