A New Estimator of Kullback--Leibler Divergence via Shannon Entropy
Mehmet Siddik Cadirci, Martin Singull

TL;DR
This paper introduces a novel estimator for Kullback-Leibler divergence based on Shannon entropy and k-nearest neighbor methods, providing an effective goodness-of-fit test for multivariate normality with superior performance in high dimensions.
Contribution
The paper proposes a new entropy-based estimator for KL divergence using kNN methods and develops a Gaussian goodness-of-fit test with proven consistency and improved power.
Findings
Estimator shows consistent convergence properties.
Test accurately controls Type I error.
Demonstrates superior power in high-dimensional settings.
Abstract
We examine the estimation of the Kullback-Leibler (KL) divergence and the use of the goodness-of-fit test for multivariate continuous distributions. Our starting point is the maximum entropy principle for Shannon entropy: among all distributions with a fixed mean vector and covariance matrix, the multivariate Gaussian distributions uniquely maximize entropy. As a result, the KL divergence from a moment-matched Gaussian distribution to an unknown density can then be written as the \emph{entropy difference}, which is a suitable information-theoretic measure of divergence from the Gaussian distribution. To estimate, we use -nearest neighbor (kNN) estimators based on Shannon entropy and KL divergence derived from the Kozachenko-Leonenko approach and subsequent improvements, along with the consistency and -convergence results established for these estimators. Motivated by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Advanced Statistical Methods and Models · Statistical Distribution Estimation and Applications
