Sparse Approximation of a Kernel Mean

E. Cruz Cort\'es; C. Scott

arXiv:1503.00323·stat.ML·March 3, 2015·2 cites

Sparse Approximation of a Kernel Mean

E. Cruz Cort\'es, C. Scott

PDF

Open Access

TL;DR

This paper introduces a fast, linear-time method for creating sparse approximations of kernel means, improving scalability in machine learning tasks like distribution embedding, class estimation, and clustering.

Contribution

It presents a novel incoherence-based bound and a $k$-center problem solution for efficient sparse kernel mean approximation with automatic sparsity selection.

Findings

01

Achieves linear time construction of sparse kernel means

02

Demonstrates computational gains in distribution embedding, class estimation, and clustering

03

Provides an automatic scheme for selecting sparsity level

Abstract

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately, kernel means are faced with scalability issues. A single point evaluation of the kernel density estimator, for example, requires a computation time linear in the training sample size. To address this challenge, we present a method to efficiently construct a sparse approximation of a kernel mean. We do so by first establishing an incoherence-based bound on the approximation error, and then noticing that, for the case of radial kernels, the bound can be minimized by solving the $k$ -center problem. The outcome is a linear time construction of a sparse kernel mean, which also lends itself naturally to an automatic sparsity selection scheme. We show the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM