Preconditioning Kernel Matrices

Kurt Cutajar; Michael A. Osborne; John P. Cunningham; Maurizio; Filippone

arXiv:1602.06693·stat.ML·May 26, 2016·ICML·22 cites

Preconditioning Kernel Matrices

Kurt Cutajar, Michael A. Osborne, John P. Cunningham, Maurizio, Filippone

PDF

Open Access 1 Repo

TL;DR

This paper introduces preconditioned conjugate gradient methods tailored for kernel machines, enhancing scalability and convergence by developing effective preconditioners and scalable hyperparameter learning techniques.

Contribution

It proposes a novel preconditioning approach for kernel matrices, improving the efficiency and scalability of kernel machine training and hyperparameter optimization.

Findings

01

Outperforms state-of-the-art approximations within the same computational budget.

02

Provides a scalable method for solving kernel machines and learning hyperparameters.

03

Demonstrates exactness in the limit of iterations.

Abstract

The computational and storage complexity of kernel machines presents the primary barrier to their scaling to large, modern, datasets. A common way to tackle the scalability issue is to use the conjugate gradient algorithm, which relieves the constraints on both storage (the kernel matrix need not be stored) and computation (both stochastic gradients and parallelization can be used). Even so, conjugate gradient is not without its own issues: the conditioning of kernel matrices is often such that conjugate gradients will have poor convergence in practice. Preconditioning is a common approach to alleviating this issue. Here we propose preconditioned conjugate gradients for kernel machines, and develop a broad range of preconditioners particularly useful for kernel matrices. We describe a scalable approach to both solving kernel machines and learning their hyperparameters. We show this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mauriziofilippone/preconditioned_GPs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Machine Learning and ELM