A novel evaluation methodology for supervised Feature Ranking algorithms
Jeroen G. S. Overschie

TL;DR
This paper introduces a new systematic evaluation methodology for supervised feature ranking algorithms using synthetic datasets, along with an open-source benchmarking framework called fseval for large-scale experiments.
Contribution
It proposes a novel evaluation methodology for feature rankers and provides an open-source benchmarking framework to facilitate systematic and large-scale assessments.
Findings
Identified strengths and weaknesses of various feature ranking algorithms
Demonstrated the effectiveness of the new evaluation methodology
Enabled large-scale, parallel experimentation with the fseval framework
Abstract
Both in the domains of Feature Selection and Interpretable AI, there exists a desire to `rank' features based on their importance. Such feature importance rankings can then be used to either: (1) reduce the dataset size or (2) interpret the Machine Learning model. In the literature, however, such Feature Rankers are not evaluated in a systematic, consistent way. Many papers have a different way of arguing which feature importance ranker works best. This paper fills this gap, by proposing a new evaluation methodology. By making use of synthetic datasets, feature importance scores can be known beforehand, allowing more systematic evaluation. To facilitate large-scale experimentation using the new methodology, a benchmarking framework was built in Python, called fseval. The framework allows running experiments in parallel and distributed over machines on HPC systems. By integrating with an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Machine Learning and Data Classification · Neural Networks and Applications
MethodsFeature Selection
