Guaranteed Deterministic Bounds on the Total Variation Distance between Univariate Mixtures
Frank Nielsen, Ke Sun

TL;DR
This paper introduces two methods to compute guaranteed deterministic bounds on the total variation distance for univariate mixture models, addressing the lack of closed-form solutions and improving efficiency and reliability.
Contribution
The paper presents novel guaranteed bounds for total variation distance in univariate mixtures, utilizing information monotonicity and geometric envelopes, which are tighter and more efficient than existing methods.
Findings
The bounds are tight for Gaussian, Gamma, and Rayleigh mixtures.
The methods outperform Monte Carlo in providing deterministic guarantees.
Experimental results demonstrate the effectiveness of the bounds.
Abstract
The total variation distance is a core statistical distance between probability measures that satisfies the metric axioms, with value always falling in . This distance plays a fundamental role in machine learning and signal processing: It is a member of the broader class of -divergences, and it is related to the probability of error in Bayesian hypothesis testing. Since the total variation distance does not admit closed-form expressions for statistical mixtures (like Gaussian mixture models), one often has to rely in practice on costly numerical integrations or on fast Monte Carlo approximations that however do not guarantee deterministic lower and upper bounds. In this work, we consider two methods for bounding the total variation of univariate mixture models: The first method is based on the information monotonicity property of the total variation to design guaranteed nested…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Machine Learning and Algorithms
