Statistical and Geometrical properties of regularized Kernel   Kullback-Leibler divergence

Cl\'ementine Chazal; Anna Korba; Francis Bach

arXiv:2408.16543·stat.ML·March 12, 2025

Statistical and Geometrical properties of regularized Kernel Kullback-Leibler divergence

Cl\'ementine Chazal, Anna Korba, Francis Bach

PDF

1 Repo 1 Video

TL;DR

This paper introduces a regularized version of the kernel Kullback-Leibler divergence (KKL) that is well-defined for all distributions, provides theoretical bounds, and develops practical algorithms for discrete distributions.

Contribution

It proposes a regularized KKL divergence, derives bounds and a closed-form expression, and develops a Wasserstein gradient descent scheme for discrete distributions.

Findings

01

Regularized KKL is well-defined for all distributions.

02

Finite-sample bounds for the divergence are established.

03

A practical gradient descent algorithm for discrete distributions is demonstrated.

Abstract

In this paper, we study the statistical and geometrical properties of the Kullback-Leibler divergence with kernel covariance operators (KKL) introduced by Bach [2022]. Unlike the classical Kullback-Leibler (KL) divergence that involves density ratios, the KKL compares probability distributions through covariance operators (embeddings) in a reproducible kernel Hilbert space (RKHS), and compute the Kullback-Leibler quantum divergence. This novel divergence hence shares parallel but different aspects with both the standard Kullback-Leibler between probability distributions and kernel embeddings metrics such as the maximum mean discrepancy. A limitation faced with the original KKL divergence is its inability to be defined for distributions with disjoint supports. To solve this problem, we propose in this paper a regularised variant that guarantees that the divergence is well defined for all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cclementine25/kkl-divergence-gradient-flows
pytorchOfficial

Videos

Statistical and Geometrical properties of the Kernel Kullback-Leibler divergence· slideslive

Taxonomy

MethodsSparse Evolutionary Training