Empirical estimation of entropy functionals with confidence

Kumar Sricharan; Raviv Raich; Alfred O. Hero III

arXiv:1012.4188·math.ST·February 28, 2012·26 cites

Empirical estimation of entropy functionals with confidence

Kumar Sricharan, Raviv Raich, Alfred O. Hero III

PDF

Open Access

TL;DR

This paper proposes a new bipartite plug-in ($k$-NN) estimator for entropy functionals that improves accuracy and confidence interval estimation by using data-splitting and boundary correction techniques.

Contribution

It introduces a novel $k$-NN based estimator with explicit bias-variance analysis, optimal parameter tuning, and asymptotic confidence intervals for entropy estimation.

Findings

01

Achieves faster convergence and lower MSE than previous estimators.

02

Provides explicit bias and variance rates for the estimator.

03

Establishes a central limit theorem for confidence interval construction.

Abstract

This paper introduces a class of k-nearest neighbor ( $k$ -NN) estimators called bipartite plug-in (BPI) estimators for estimating integrals of non-linear functions of a probability density, such as Shannon entropy and R\'enyi entropy. The density is assumed to be smooth, have bounded support, and be uniformly bounded from below on this set. Unlike previous $k$ -NN estimators of non-linear density functionals, the proposed estimator uses data-splitting and boundary correction to achieve lower mean square error. Specifically, we assume that $T$ i.i.d. samples $X_{i} \in R^{d}$ from the density are split into two pieces of cardinality $M$ and $N$ respectively, with $M$ samples used for computing a k-nearest-neighbor density estimate and the remaining $N$ samples used for empirical estimation of the integral of the density functional. By studying the statistical properties of k-NN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Statistical Methods and Inference