Algorithms and Hardness for Estimating Statistical Similarity
Arnab Bhattacharyya, Sutanu Gayen, Kuldeep S. Meel, Dimitrios Myrisiotis, A. Pavan, N. V. Vinodchandran

TL;DR
This paper investigates the computational complexity of estimating statistical similarity between probability distributions, providing an efficient approximation algorithm for product distributions and proving hardness results for more complex models.
Contribution
It introduces the first FPTAS for estimating statistical similarity between product distributions and proves NP-hardness for Bayes net distributions of in-degree 2.
Findings
FPTAS achieves efficient approximation for product distributions.
Estimating similarity for Bayes nets of in-degree 2 is NP-hard.
Computational difficulty varies significantly with distribution class.
Abstract
We introduce and study the computational problem of determining statistical similarity between probability distributions. For distributions and over a finite sample space, their statistical similarity is defined as . Despite its fundamental nature as a measure of similarity between distributions, capturing essential concepts such as Bayes error in prediction and hypothesis testing, this computational problem has not been previously explored. Recent work on computing statistical distance has established that, somewhat surprisingly, even for the simple class of product distributions, exactly computing statistical similarity is -hard. This motivates the question of designing approximation algorithms for statistical similarity. Our first contribution is a Fully Polynomial-Time deterministic Approximation Scheme…
Peer Reviews
Decision·Submitted to ICLR 2026
Estimation properties of distributions is a fundamental topic in machine learning and at least the "complement" problem of estimating the TV distance is a well studied topic.
A major weakness of the paper is that it feels slightly misleading for the following reasons: There is actually nothing 'statistical' about the paper as the title/intro suggests. The algorithm is not sampling from the distributions! Rather, this is a purely query complexity result where the result assumes knowledge of the underlying probability values. There is no notion of sample complexity which is what the title 'statistical similarity' would suggest. In fact, this problem does not seem to
The problems studied here are fundamental. The paper provides strong theoretical results with clean proofs, both for algorithms and for lower bounds. The introduction motivated well that these are important problems to study, particularly these special cawses (product distribution adn Bayes net) which I was skeptical of at first.
My main concern is that the proof techniques apper quite standard or follow from prior work. Could you comment on where the technical novelty is? In particular, why does the result of Bhattacharyya et al. (2023) not already essentially imply Theorem 7?
The paper defines an FPTAS for computing the statistical similarity between product distributions, and confirms the NP-hardness of obtaining a multiplicative approximation for distributions given by Bayes nets of in-degree two. The work fills an interesting theoretical gap, complementing similar results for the total variation distance. The description of the algorithm, and the proofs of the various results are clear and easy to follow.
There are some potential issues with the proof of Theorem 6 (existence of an FPTAS algorithm) that I have highlighted below. I would be willing to provide a more positive score if these can be resolved. Perhaps some comments on the practical aspects of the proposed algorithm relative to the FPTAS for the TV distance (or other estimation techniques for this) could further motivate the work.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Neural Networks and Applications · Advanced Clustering Algorithms Research
