The Vendi Score: A Diversity Evaluation Metric for Machine Learning
Dan Friedman, Adji Bousso Dieng

TL;DR
The paper introduces the Vendi Score, a flexible, reference-free diversity metric for machine learning that leverages similarity matrices and entropy to evaluate diversity across various domains.
Contribution
It proposes the Vendi Score, a novel diversity metric based on entropy of eigenvalues, applicable without reference datasets and adaptable to any similarity measure.
Findings
Addresses limitations of existing diversity metrics in ML.
Effectively measures diversity in molecular, image, and text generative models.
Reveals mode collapse in GANs even when all data modes are captured.
Abstract
Diversity is an important criterion for many areas of machine learning (ML), including generative modeling and dataset curation. However, existing metrics for measuring diversity are often domain-specific and limited in flexibility. In this paper, we address the diversity evaluation problem by proposing the Vendi Score, which connects and extends ideas from ecology and quantum statistical mechanics to ML. The Vendi Score is defined as the exponential of the Shannon entropy of the eigenvalues of a similarity matrix. This matrix is induced by a user-defined similarity function applied to the sample to be evaluated for diversity. In taking a similarity function as input, the Vendi Score enables its user to specify any desired form of diversity. Importantly, unlike many existing metrics in ML, the Vendi Score does not require a reference dataset or distribution over samples or labels, it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Cell Image Analysis Techniques · Machine Learning in Materials Science
