How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets
Zhiyun Lu, Avner May, Kuan Liu, Alireza Bagheri Garakani and, Dong Guo, Aur\'elien Bellet, Linxi Fan, Michael Collins, Brian, Kingsbury, Michael Picheny, Fei Sha

TL;DR
This paper develops scalable kernel methods that can handle large-scale problems and achieve performance comparable to deep neural networks, offering a convex optimization approach with fewer hyperparameters.
Contribution
It introduces efficient techniques to scale kernel models to large datasets, enabling direct comparison with deep learning architectures on large-scale tasks.
Findings
Kernel models match DNN performance on image and speech recognition.
Training cost for kernel models is comparable to DNNs, with fewer hyperparameters.
First direct large-scale comparison between kernel methods and deep neural networks.
Abstract
The computational complexity of kernel methods has often been a major barrier for applying them to large-scale learning problems. We argue that this barrier can be effectively overcome. In particular, we develop methods to scale up kernel models to successfully tackle large-scale learning problems that are so far only approachable by deep learning architectures. Based on the seminal work by Rahimi and Recht on approximating kernel functions with features derived from random projections, we advance the state-of-the-art by proposing methods that can efficiently train models with hundreds of millions of parameters, and learn optimal representations from multiple kernels. We conduct extensive empirical studies on problems from image recognition and automatic speech recognition, and show that the performance of our kernel models matches that of well-engineered deep neural nets (DNNs). To the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and Data Classification
