Learning primal-dual sparse kernel machines
Riikka Huusari, Sahely Bhadra, C\'ecile Capponi, Hachem Kadri, Juho, Rousu

TL;DR
This paper introduces a primal-dual sparse kernel machine approach that improves scalability and interpretability over traditional kernel methods by optimizing over sparse elements in the input space, supported by theoretical generalization guarantees.
Contribution
It proposes a novel primal-dual sparse kernel learning method that avoids the representer theorem, enabling scalable and interpretable models with theoretical generalization bounds.
Findings
Achieves comparable accuracy to traditional kernel methods.
Demonstrates improved scalability and interpretability.
Provides theoretical generalization guarantees.
Abstract
Traditionally, kernel methods rely on the representer theorem which states that the solution to a learning problem is obtained as a linear combination of the data mapped into the reproducing kernel Hilbert space (RKHS). While elegant from theoretical point of view, the theorem is prohibitive for algorithms' scalability to large datasets, and the interpretability of the learned function. In this paper, instead of using the traditional representer theorem, we propose to search for a solution in RKHS that has a pre-image decomposition in the original data space, where the elements don't necessarily correspond to the elements in the training set. Our gradient-based optimisation method then hinges on optimising over possibly sparse elements in the input space, and enables us to obtain a kernel-based model with both primal and dual sparsity. We give theoretical justification on the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
