Optimised Feature Subset Selection via Simulated Annealing

Fernando Mart\'inez-Garc\'ia; \'Alvaro Rubio-Garc\'ia; Samuel Fern\'andez-Lorenzo; Juan Jos\'e Garc\'ia-Ripoll; Diego Porras

arXiv:2507.23568·cs.LG·August 1, 2025

Optimised Feature Subset Selection via Simulated Annealing

Fernando Mart\'inez-Garc\'ia, \'Alvaro Rubio-Garc\'ia, Samuel Fern\'andez-Lorenzo, Juan Jos\'e Garc\'ia-Ripoll, Diego Porras

PDF

Open Access

TL;DR

This paper presents SA-FDR, a simulated annealing-based algorithm for optimal feature subset selection that balances model simplicity and accuracy in high-dimensional classification tasks.

Contribution

SA-FDR introduces a global search approach for $\, ext{l}_0$-norm feature selection using simulated annealing guided by Fisher discriminant ratio, improving over greedy methods.

Findings

01

SA-FDR selects more compact feature subsets with high accuracy.

02

It effectively captures inter-feature dependencies missed by greedy methods.

03

Demonstrates scalability on large datasets with hundreds of thousands of samples.

Abstract

We introduce SA-FDR, a novel algorithm for $ℓ_{0}$ -norm feature selection that considers this task as a combinatorial optimisation problem and solves it by using simulated annealing to perform a global search over the space of feature subsets. The optimisation is guided by the Fisher discriminant ratio, which we use as a computationally efficient proxy for model quality in classification tasks. Our experiments, conducted on datasets with up to hundreds of thousands of samples and hundreds of features, demonstrate that SA-FDR consistently selects more compact feature subsets while achieving a high predictive accuracy. This ability to recover informative yet minimal sets of features stems from its capacity to capture inter-feature dependencies often missed by greedy optimisation approaches. As a result, SA-FDR provides a flexible and effective solution for designing interpretable models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition