How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets

Zhiyun Lu; Avner May; Kuan Liu; Alireza Bagheri Garakani and; Dong Guo; Aur\'elien Bellet; Linxi Fan; Michael Collins; Brian; Kingsbury; Michael Picheny; Fei Sha

arXiv:1411.4000·cs.LG·January 25, 2019·46 cites

How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets

Zhiyun Lu, Avner May, Kuan Liu, Alireza Bagheri Garakani and, Dong Guo, Aur\'elien Bellet, Linxi Fan, Michael Collins, Brian, Kingsbury, Michael Picheny, Fei Sha

PDF

Open Access

TL;DR

This paper develops scalable kernel methods that can handle large-scale problems and achieve performance comparable to deep neural networks, offering a convex optimization approach with fewer hyperparameters.

Contribution

It introduces efficient techniques to scale kernel models to large datasets, enabling direct comparison with deep learning architectures on large-scale tasks.

Findings

01

Kernel models match DNN performance on image and speech recognition.

02

Training cost for kernel models is comparable to DNNs, with fewer hyperparameters.

03

First direct large-scale comparison between kernel methods and deep neural networks.

Abstract

The computational complexity of kernel methods has often been a major barrier for applying them to large-scale learning problems. We argue that this barrier can be effectively overcome. In particular, we develop methods to scale up kernel models to successfully tackle large-scale learning problems that are so far only approachable by deep learning architectures. Based on the seminal work by Rahimi and Recht on approximating kernel functions with features derived from random projections, we advance the state-of-the-art by proposing methods that can efficiently train models with hundreds of millions of parameters, and learn optimal representations from multiple kernels. We conduct extensive empirical studies on problems from image recognition and automatic speech recognition, and show that the performance of our kernel models matches that of well-engineered deep neural nets (DNNs). To the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and Data Classification