TL;DR
This paper introduces a simple Quantile Spacing method for accurately estimating one-dimensional entropy from equiprobable samples, outperforming the traditional Bin-Counting approach by avoiding hyperparameter tuning and providing reliable uncertainty estimates.
Contribution
The paper presents the QS method for entropy estimation, which uses quantile-based intervals and is less sensitive to distributional assumptions than BC, with fixed optimal quantile count relative to sample size.
Findings
QS method achieves less than 1% bias across tested distributions.
Optimal number of quantiles is approximately 25-35% of sample size.
Bootstrapping accurately captures the entropy estimate uncertainty.
Abstract
We develop a simple Quantile Spacing (QS) method for accurate probabilistic estimation of one-dimensional entropy from equiprobable random samples, and compare it with the popular Bin-Counting (BC) method. In contrast to BC, which uses equal-width bins with varying probability mass, the QS method uses estimates of the quantiles that divide the support of the data generating probability density function (pdf) into equal-probability-mass intervals. Whereas BC requires optimal tuning of a bin-width hyper-parameter whose value varies with sample size and shape of the pdf, QS requires specification of the number of quantiles to be used. Results indicate, for the class of distributions tested, that the optimal number of quantile-spacings is a fixed fraction of the sample size (empirically determined to be ~0.25-0.35), and that this value is relatively insensitive to distributional form or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
