A Global Characterization of $f$-Divergences Yielding PSD Mutual-Information Matrices

Zachary Robertson

arXiv:2601.08929·cs.IT·May 15, 2026

A Global Characterization of $f$-Divergences Yielding PSD Mutual-Information Matrices

Zachary Robertson

PDF

TL;DR

This paper characterizes when pairwise $f$-mutual information matrices form positive semi-definite kernels, revealing the divergence properties that determine their PSD nature across finite-alphabet variables.

Contribution

It provides a complete characterization of convex $f$-divergences that produce PSD mutual-information matrices, including necessary and sufficient conditions.

Findings

01

The normalized generator must have a nonnegative power series expansion around 1.

02

Shannon mutual information and Jensen-Shannon divergence do not produce PSD matrices.

03

Chi-squared divergence always yields a PSD mutual-information matrix.

Abstract

Given $n$ random variables, when does the matrix of pairwise $f$ -mutual informations define a PSD kernel over variables? For convex finite generators $f : (0, \infty) \to R$ with $f (1) = 0$ and finite boundary value $f (0)$ , we give a closed characterization up to linear transformation $f \sim f + c (t - 1)$ , which leaves every $f$ -divergence and every $f$ -mutual-information matrix unchanged. The matrix $M_{ij}^{(f)} := I_{f} (X_{i}; X_{j})$ is PSD for every finite-alphabet family if and only if the normalized representative has a globally convergent expansion $\overset{ˉ}{f} (t) = \sum_{m \geq 2} a_{m} (t - 1)^{m}$ , with $a_{m} \geq 0$ , on all of $(0, \infty)$ . Sufficiency follows from a replica embedding for monomial generators plus closure under nonnegative mixtures. Necessity first extracts the local Taylor cone at $1$ using biased three-point kernels $H_{a}$ , the Belton--Guillot--Khare--Putinar (BGKP) low-rank Hankel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.