TL;DR
This paper introduces SAGA, a surrogate-assisted genetic algorithm that significantly accelerates feature selection on large datasets by using multi-level approximations, achieving higher accuracy in less time.
Contribution
The paper presents a novel multi-stage surrogate-assisted framework and a specific algorithm, SAGA, which improves computational efficiency and solution quality in wrapper feature selection.
Findings
SAGA reduces computation time by three times compared to baseline.
SAGA converges to higher accuracy solutions.
Surrogate-assisted stage scales better with complex induction algorithms.
Abstract
Feature selection is an intractable problem, therefore practical algorithms often trade off the solution accuracy against the computation time. In this paper, we propose a novel multi-stage feature selection framework utilizing multiple levels of approximations, or surrogates. Such a framework allows for using wrapper approaches in a much more computationally efficient way, significantly increasing the quality of feature selection solutions achievable, especially on large datasets. We design and evaluate a Surrogate-Assisted Genetic Algorithm (SAGA) which utilizes this concept to guide the evolutionary search during the early phase of exploration. SAGA only switches to evaluating the original function at the final exploitation phase. We prove that the run-time upper bound of SAGA surrogate-assisted stage is at worse equal to the wrapper GA, and it scales better for induction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection · SAGA · Genetic Algorithms
