Gaussian processes with multidimensional distribution inputs via optimal transport and Hilbertian embedding
Francois Bachoc, Alexandra Suvorikova, David Ginsbourger, Jean-Michel, Loubes, Vladimir Spokoiny

TL;DR
This paper develops a novel approach for Gaussian processes with multidimensional distribution inputs using optimal transport and Hilbert space embeddings, addressing limitations of Wasserstein-based kernels in higher dimensions.
Contribution
It introduces a new positive definite kernel construction for multivariate distributions via Hilbert space embeddings and analyzes their statistical and practical properties.
Findings
Kernel constructions are valid in high-dimensional spaces.
The proposed kernels are consistent with empirical barycenters.
Application to regression shows promising results in simulations.
Abstract
In this work, we investigate Gaussian Processes indexed by multidimensional distributions. While directly constructing radial positive definite kernels based on the Wasserstein distance has been proven to be possible in the unidimensional case, such constructions do not extend well to the multidimensional case as we illustrate here. To tackle the problem of defining positive definite kernels between multivariate distributions based on optimal transport, we appeal instead to Hilbert space embeddings relying on optimal transport maps to a reference distribution, that we suggest to take as a Wasserstein barycenter. We characterize in turn radial positive definite kernels on Hilbert spaces, and show that the covariance parameters of virtually all parametric families of covariance functions are microergodic in the case of (infinite-dimensional) Hilbert spaces. We also investigate statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
