Asymptotic Normality for Plug-in Estimators of Generalized Shannon's Entropy
Jialin Zhang, Jingyi Shi

TL;DR
This paper establishes the asymptotic normality of the plug-in estimator for generalized Shannon's entropy on countable alphabets, enabling statistical inference without distribution assumptions.
Contribution
It provides the first asymptotic normality results for the plug-in estimator of generalized Shannon's entropy, broadening its theoretical foundation and practical applicability.
Findings
Asymptotic normality holds without distribution assumptions.
Enables interval estimation and hypothesis testing for generalized Shannon's entropy.
Extends theoretical understanding of entropy estimation in information theory.
Abstract
Shannon's entropy is one of the building blocks of information theory and an essential aspect of Machine Learning methods (e.g., Random Forests). Yet, it is only finitely defined for distributions with fast decaying tails on a countable alphabet. The unboundedness of Shannon's entropy over the general class of all distributions on an alphabet prevents its potential utility from being fully realized. To fill the void in the foundation of information theory, Zhang (2020) proposed generalized Shannon's entropy, which is finitely defined everywhere. The plug-in estimator, adopted in almost all entropy-based ML method packages, is one of the most popular approaches to estimating Shannon's entropy. The asymptotic distribution for Shannon's entropy's plug-in estimator was well studied in the existing literature. This paper studies the asymptotic properties for the plug-in estimator of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
