Sliced Wasserstein Kernels for Probability Distributions

Soheil Kolouri; Yang Zou; and Gustavo K. Rohde

arXiv:1511.03198·cs.LG·November 11, 2015

Sliced Wasserstein Kernels for Probability Distributions

Soheil Kolouri, Yang Zou, and Gustavo K. Rohde

PDF

TL;DR

This paper introduces a new family of positive definite kernels based on the Sliced Wasserstein distance, enhancing kernel methods for probability distributions in machine learning.

Contribution

It provides the first provably positive definite kernels derived from the Sliced Wasserstein distance for use in kernel methods.

Findings

01

Improved performance in learning tasks using Sliced Wasserstein kernels

02

Theoretical guarantees of positive definiteness

03

Enhanced computational efficiency for optimal transport distances

Abstract

Optimal transport distances, otherwise known as Wasserstein distances, have recently drawn ample attention in computer vision and machine learning as a powerful discrepancy measure for probability distributions. The recent developments on alternative formulations of the optimal transport have allowed for faster solutions to the problem and has revamped its practical applications in machine learning. In this paper, we exploit the widely used kernel methods and provide a family of provably positive definite kernels based on the Sliced Wasserstein distance and demonstrate the benefits of these kernels in a variety of learning tasks. Our work provides a new perspective on the application of optimal transport flavored distances through kernel methods in machine learning tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.