Sparse Multiple Kernel Learning with Geometric Convergence Rate
Rong Jin, Tianbao Yang, Mehrdad Mahdavi

TL;DR
This paper introduces an efficient greedy coordinate descent algorithm for sparse multiple kernel learning that achieves geometric convergence rates and provides generalization error bounds based on local Rademacher complexity.
Contribution
The paper presents a novel algorithm for sparse MKL with geometric convergence guarantees and a new data-dependent gradient measurement approach.
Findings
Achieves geometric convergence rate under certain conditions.
Provides a generalization error bound using local Rademacher complexity.
Demonstrates improved efficiency over previous MKL algorithms.
Abstract
In this paper, we study the problem of sparse multiple kernel learning (MKL), where the goal is to efficiently learn a combination of a fixed small number of kernels from a large pool that could lead to a kernel classifier with a small prediction error. We develop an efficient algorithm based on the greedy coordinate descent algorithm, that is able to achieve a geometric convergence rate under appropriate conditions. The convergence rate is achieved by measuring the size of functional gradients by an empirical norm that depends on the empirical data distribution. This is in contrast to previous algorithms that use a functional norm to measure the size of gradients, which is independent from the data samples. We also establish a generalization error bound of the learned sparse kernel classifier using the technique of local Rademacher complexity.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Machine Learning and ELM · Face and Expression Recognition
