Estimating the Shannon Entropy Using the Pitman--Yor Process
Takato Hashino, Koji Tsukuda

TL;DR
This paper introduces a Bayesian nonparametric method based on the Pitman--Yor process for estimating Shannon entropy, effectively handling unknown or underestimated species diversity in various fields.
Contribution
It proposes a novel entropy estimation technique that relaxes the assumption of known species count using the Pitman--Yor process, with proven consistency and demonstrated effectiveness.
Findings
The estimator is consistent for regularly varying distributions.
Numerical experiments show improved robustness over traditional methods.
The approach effectively estimates entropy with limited species observations.
Abstract
The Shannon entropy is a fundamental measure for quantifying diversity and model complexity in fields such as information theory, ecology, and genetics. However, many existing studies assume that the number of species is known, an assumption that is often unrealistic in practice. In recent years, efforts have been made to relax this restriction. Motivated by these developments, this study proposes an entropy estimation method based on the Pitman--Yor process, a representative approach in Bayesian nonparametrics. By approximating the true distribution as an infinite-dimensional process, the proposed method enables stable estimation even when the number of observed species is smaller than the true number of species. This approach provides a principled way to deal with the uncertainty in species diversity and enhances the reliability and robustness of entropy-based diversity assessment. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCensus and Population Estimation · Bayesian Methods and Mixture Models · Species Distribution and Climate Change
