Distance Estimation for High-Dimensional Discrete Distributions
Gunjan Kumar, Kuldeep S. Meel, Yash Pote

TL;DR
This paper introduces the first polynomial-query algorithm for estimating the statistical distance between high-dimensional distributions using subcube conditional sampling, a model relevant for practical discrete data analysis.
Contribution
It presents a novel polynomial-query distance estimator in the subcube conditional sampling model, extending the capabilities of high-dimensional distribution testing.
Findings
Algorithm makes (n^3/ ^5) queries
First polynomial-query estimator in model
Applicable to practical discrete sampling scenarios
Abstract
Given two distributions and over a high-dimensional domain , and a parameter , the goal of distance estimation is to determine the statistical distance between and , up to an additive tolerance . Since exponential lower bounds (in ) are known for the problem in the standard sampling model, research has focused on richer query models where one can draw conditional samples. This paper presents the first polynomial query distance estimator in the conditional sampling model (). We base our algorithm on the relatively weaker \textit{subcube conditional} sampling () oracle, which draws samples from the distribution conditioned on some of the dimensions. is a promising model for widespread practical use because it captures the natural behavior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
