L2 Regularization for Learning Kernels

Corinna Cortes; Mehryar Mohri; Afshin Rostamizadeh

arXiv:1205.2653·cs.LG·May 14, 2012·363 cites

L2 Regularization for Learning Kernels

Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

PDF

Open Access

TL;DR

This paper investigates L2 regularization for learning kernels in regression, providing theoretical analysis, an efficient algorithm, and experimental results showing its advantages over L1 regularization, especially with many kernels.

Contribution

It introduces an L2 regularization framework for kernel learning, derives the solution form, and offers theoretical and empirical evidence of its effectiveness.

Findings

01

L2 regularization improves performance with many kernels

02

Theoretical stability bounds are established for orthogonal kernels

03

L1 regularization can degrade performance in large-scale cases

Abstract

The choice of the kernel is critical to the success of many learning algorithms but it is typically left to the user. Instead, the training data can be used to learn the kernel by selecting it out of a given family, such as that of non-negative linear combinations of p base kernels, constrained by a trace or L1 regularization. This paper studies the problem of learning kernels with the same family of kernels but with an L2 regularization instead, and for regression problems. We analyze the problem of learning kernels with ridge regression. We derive the form of the solution of the optimization problem and give an efficient iterative algorithm for computing that solution. We present a novel theoretical analysis of the problem based on stability and give learning bounds for orthogonal kernels that contain only an additive term O(pp/m) when compared to the standard kernel ridge regression…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Machine Learning and ELM · Domain Adaptation and Few-Shot Learning