Breaking the Bandwidth Barrier: Geometrical Adaptive Entropy Estimation
Weihao Gao, Sewoong Oh, Pramod Viswanath

TL;DR
This paper introduces a novel entropy and mutual information estimator combining geometric and kernel methods with local bandwidth selection, achieving improved accuracy and a universal bias correction.
Contribution
It proposes a new estimator that integrates geometric and kernel approaches with local bandwidths, and demonstrates its universal bias properties and theoretical foundations.
Findings
Outperforms existing state-of-the-art estimators
Bias can be pre-computed and corrected universally
Provides a unified framework for kernel and NN estimators
Abstract
Estimators of information theoretic measures such as entropy and mutual information are a basic workhorse for many downstream applications in modern data science. State of the art approaches have been either geometric (nearest neighbor (NN) based) or kernel based (with a globally chosen bandwidth). In this paper, we combine both these approaches to design new estimators of entropy and mutual information that outperform state of the art methods. Our estimator uses local bandwidth choices of -NN distances with a finite , independent of the sample size. Such a local and data dependent choice improves performance in practice, but the bandwidth is vanishing at a fast rate, leading to a non-vanishing bias. We show that the asymptotic bias of the proposed estimator is universal; it is independent of the underlying distribution. Hence, it can be pre-computed and subtracted from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
