Nonparametric Estimation of Joint Entropy via Partitioned Sample-Spacing
Jungwoo Ho, Sangun Park, Soyeong Oh

TL;DR
This paper introduces a nonparametric joint entropy estimator using partitioned sample spacing that outperforms traditional methods in accuracy, scalability, and robustness without requiring training or density modeling.
Contribution
The paper presents a novel PSS-based estimator for multivariate joint entropy that extends univariate spacing ideas to higher dimensions with strong consistency guarantees.
Findings
PSS outperforms $k$-nearest neighbor estimators in benchmarks.
PSS achieves accuracy comparable to normalizing flow methods.
PSS scales well in dimensions 10 to 40 and is robust to distribution skewness.
Abstract
We propose a nonparametric estimator of multivariate joint entropy based on partitioned sample spacing (PSS). The method extends univariate spacing ideas to by partitioning into localized cells and aggregating within-cell statistics, with strong consistency guarantees under mild conditions. In benchmarks across diverse distributions, PSS consistently outperforms -nearest neighbor estimators and achieves accuracy competitive with recent normalizing flow-based methods, while requiring no training or auxiliary density modeling. The estimator scales favorably in moderately high dimensions (--) and shows particular robustness to correlated or skewed distributions. These properties position PSS as a practical and reliable alternative to both NN and NF-based entropy estimators, with broad utility in information-theoretic machine learning tasks such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Generative Adversarial Networks and Image Synthesis · Bayesian Methods and Mixture Models
