TL;DR
This paper introduces representational R\'enyi heterogeneity (RRH), a novel method for measuring heterogeneity in data by transforming observable data into a latent space, avoiding the need for predefined categories or distance metrics.
Contribution
The paper proposes RRH, a new heterogeneity measure that operates on learned latent representations, extending applicability to non-categorical and complex data.
Findings
RRH generalizes existing biodiversity and economic indices.
RRH responds more appropriately to changes in mixture components.
RRH effectively measures heterogeneity in natural images using deep neural network representations.
Abstract
A discrete system's heterogeneity is measured by the R\'enyi heterogeneity family of indices (also known as Hill numbers or Hannah--Kay indices), whose units are {the numbers equivalent}. Unfortunately, numbers equivalent heterogeneity measures for non-categorical data require {a priori} (A) categorical partitioning and (B) pairwise distance measurement on the observable data space, thereby precluding application to problems with ill-defined categories or where semantically relevant features must be learned as abstractions from some data. We thus introduce representational R\'enyi heterogeneity (RRH), which transforms an observable domain onto a latent space upon which the R\'enyi heterogeneity is both tractable and semantically relevant. This method requires neither {a priori} binning nor definition of a distance function on the observable space. We show that RRH can generalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
