Linear Time Feature Selection for Regularized Least-Squares
Tapio Pahikkala, Antti Airola, Tapio Salakoski

TL;DR
This paper introduces a fast, linear-time greedy feature selection algorithm for regularized least-squares models, enabling scalable and efficient large-scale learning with improved feature subset quality.
Contribution
The paper presents a novel linear-time greedy feature selection algorithm for RLS models, significantly improving speed and scalability over previous methods.
Findings
Algorithm is faster than previous methods due to linear complexity.
Effectively finds high-quality feature subsets in large datasets.
Demonstrates scalability and practical utility in experiments.
Abstract
We propose a novel algorithm for greedy forward feature selection for regularized least-squares (RLS) regression and classification, also known as the least-squares support vector machine or ridge regression. The algorithm, which we call greedy RLS, starts from the empty feature set, and on each iteration adds the feature whose addition provides the best leave-one-out cross-validation performance. Our method is considerably faster than the previously proposed ones, since its time complexity is linear in the number of training examples, the number of features in the original data set, and the desired size of the set of selected features. Therefore, as a side effect we obtain a new training algorithm for learning sparse linear RLS predictors which can be used for large scale learning. This speed is possible due to matrix calculus based short-cuts for leave-one-out and feature addition. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Sparse and Compressive Sensing Techniques · Blind Source Separation Techniques
