Vendi Information Gain: An Alternative To Mutual Information For Science And Machine Learning

Quan Nguyen; Adji Bousso Dieng

arXiv:2505.09007·cs.IT·May 20, 2025

Vendi Information Gain: An Alternative To Mutual Information For Science And Machine Learning

Quan Nguyen, Adji Bousso Dieng

PDF

Open Access

TL;DR

This paper introduces Vendi Information Gain (VIG), a new similarity-based measure that overcomes mutual information's limitations, enabling more effective information quantification in high-dimensional and sample-based scenarios.

Contribution

VIG is a novel, similarity-aware alternative to mutual information that only requires samples, is asymmetric, and generalizes MI, expanding information theory applications.

Findings

01

VIG outperforms MI in high-dimensional data analysis.

02

VIG effectively models human response times and epidemic processes.

03

VIG provides a unified framework for active data acquisition.

Abstract

In his 1948 seminal paper A Mathematical Theory of Communication that birthed information theory, Claude Shannon introduced mutual information (MI), which he called "rate of transmission", as a way to quantify information gain (IG) and defined it as the difference between the marginal and conditional entropy of a random variable. While MI has become a standard tool in science and engineering, it has several shortcomings. First, MI is often intractable - it requires a density over samples with tractable Shannon entropy - and existing techniques for approximating it often fail, especially in high dimensions. Moreover, in settings where MI is tractable, its symmetry and insensitivity to sample similarity are undesirable. In this paper, we propose the Vendi Information Gain (VIG), a novel alternative to MI that leverages the Vendi scores, a flexible family of similarity-based diversity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Physics and Python Applications · Scientific Computing and Data Management