Fast Robust Kernel Regression through Sign Gradient Descent with Early Stopping
Oskar Allerbo

TL;DR
This paper introduces a fast, robust kernel regression method using sign gradient descent with early stopping, replacing traditional penalties to achieve robustness and sparsity, and demonstrates significant speed improvements over existing methods.
Contribution
The paper presents a novel formulation connecting regularization penalties with gradient descent methods, enabling efficient robust and sparse kernel regression.
Findings
Sign gradient descent achieves 1-2 orders of magnitude faster computation.
The method maintains comparable accuracy to existing robust kernel regression techniques.
Theoretical bounds relate early stopping to regularization effects.
Abstract
Kernel ridge regression, KRR, is a generalization of linear ridge regression that is non-linear in the data, but linear in the model parameters. Here, we introduce an equivalent formulation of the objective function of KRR, which opens up for replacing the ridge penalty with the and penalties. Using the and penalties, we obtain robust and sparse kernel regression, respectively. We study the similarities between explicitly regularized kernel regression and the solutions obtained by early stopping of iterative gradient-based methods, where we connect regularization to sign gradient descent, regularization to forward stagewise regression (also known as coordinate descent), and regularization to gradient descent, and, in the last case, theoretically bound for the differences. We exploit the close relations between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Machine Learning and ELM · Bone and Joint Diseases
