Optimal Partitions for Nonparametric Multivariate Entropy Estimation
Z. Keskin

TL;DR
This paper introduces an optimal partitioning method using k-d trees for nonparametric multivariate entropy estimation, improving accuracy especially for correlated distributions by aligning partitions to the data's orientation.
Contribution
It reformulates the entropy estimator with equiprobable histograms, revealing an implicit orientation parameter and proposing an optimal rotation to enhance estimation accuracy.
Findings
More accurate entropy estimates for correlated Gaussian distributions
Improved performance over existing techniques across different sample sizes
Demonstrated reduction in bias through optimal partition orientation
Abstract
Efficient and accurate estimation of multivariate empirical probability distributions is fundamental to the calculation of information-theoretic measures such as mutual information and transfer entropy. Common techniques include variations on histogram estimation which, whilst computationally efficient, are often unable to precisely capture the probability density of samples with high correlation, kurtosis or fine substructure, especially when sample sizes are small. Adaptive partitions, which adjust heuristically to the sample, can reduce the bias imparted from the geometry of the histogram itself, but these have commonly focused on the location, scale and granularity of the partition, the effects of which are limited for highly correlated distributions. In this paper, I reformulate the differential entropy estimator for the special case of an equiprobable histogram, using a k-d tree…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Neural Networks and Applications · Time Series Analysis and Forecasting
