Empirical estimation of entropy functionals with confidence
Kumar Sricharan, Raviv Raich, Alfred O. Hero III

TL;DR
This paper proposes a new bipartite plug-in ($k$-NN) estimator for entropy functionals that improves accuracy and confidence interval estimation by using data-splitting and boundary correction techniques.
Contribution
It introduces a novel $k$-NN based estimator with explicit bias-variance analysis, optimal parameter tuning, and asymptotic confidence intervals for entropy estimation.
Findings
Achieves faster convergence and lower MSE than previous estimators.
Provides explicit bias and variance rates for the estimator.
Establishes a central limit theorem for confidence interval construction.
Abstract
This paper introduces a class of k-nearest neighbor (-NN) estimators called bipartite plug-in (BPI) estimators for estimating integrals of non-linear functions of a probability density, such as Shannon entropy and R\'enyi entropy. The density is assumed to be smooth, have bounded support, and be uniformly bounded from below on this set. Unlike previous -NN estimators of non-linear density functionals, the proposed estimator uses data-splitting and boundary correction to achieve lower mean square error. Specifically, we assume that i.i.d. samples from the density are split into two pieces of cardinality and respectively, with samples used for computing a k-nearest-neighbor density estimate and the remaining samples used for empirical estimation of the integral of the density functional. By studying the statistical properties of k-NN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Statistical Methods and Inference
