On a Variational Definition for the Jensen-Shannon Symmetrization of Distances based on the Information Radius
Frank Nielsen

TL;DR
This paper introduces a generalized variational framework for Jensen-Shannon divergence and symmetrization of distances, extending existing concepts and enabling applications in clustering and quantization of probability measures.
Contribution
It proposes a novel variational definition for Jensen-Shannon symmetrization applicable to arbitrary distances, generalizing Sibson's information radius and related divergence measures.
Findings
Generalized Jensen-Shannon divergences for arbitrary distances
Application to clustering and quantization of probability measures
Extension to relative Jensen-Shannon divergences and information projections
Abstract
We generalize the Jensen-Shannon divergence by considering a variational definition with respect to a generic mean extending thereby the notion of Sibson's information radius. The variational definition applies to any arbitrary distance and yields another way to define a Jensen-Shannon symmetrization of distances. When the variational optimization is further constrained to belong to prescribed probability measure families, we get relative Jensen-Shannon divergences and symmetrizations which generalize the concept of information projections. Finally, we discuss applications of these variational Jensen-Shannon divergences and diversity indices to clustering and quantization tasks of probability measures including statistical mixtures.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
