TL;DR
This paper introduces a regularized version of the kernel Kullback-Leibler divergence (KKL) that is well-defined for all distributions, provides theoretical bounds, and develops practical algorithms for discrete distributions.
Contribution
It proposes a regularized KKL divergence, derives bounds and a closed-form expression, and develops a Wasserstein gradient descent scheme for discrete distributions.
Findings
Regularized KKL is well-defined for all distributions.
Finite-sample bounds for the divergence are established.
A practical gradient descent algorithm for discrete distributions is demonstrated.
Abstract
In this paper, we study the statistical and geometrical properties of the Kullback-Leibler divergence with kernel covariance operators (KKL) introduced by Bach [2022]. Unlike the classical Kullback-Leibler (KL) divergence that involves density ratios, the KKL compares probability distributions through covariance operators (embeddings) in a reproducible kernel Hilbert space (RKHS), and compute the Kullback-Leibler quantum divergence. This novel divergence hence shares parallel but different aspects with both the standard Kullback-Leibler between probability distributions and kernel embeddings metrics such as the maximum mean discrepancy. A limitation faced with the original KKL divergence is its inability to be defined for distributions with disjoint supports. To solve this problem, we propose in this paper a regularised variant that guarantees that the divergence is well defined for all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
MethodsSparse Evolutionary Training
