Adjusted Measures for Feature Selection Stability for Data Sets with   Similar Features

Andrea Bommert; J\"org Rahnenf\"uhrer

arXiv:2009.12075·stat.ML·January 18, 2021

Adjusted Measures for Feature Selection Stability for Data Sets with Similar Features

Andrea Bommert, J\"org Rahnenf\"uhrer

PDF

1 Repo

TL;DR

This paper introduces new adjusted stability measures for feature selection that effectively handle highly similar or correlated features, addressing limitations of existing measures and improving reliability in feature stability assessment.

Contribution

The authors propose novel adjusted stability measures that account for feature similarities, overcoming theoretical drawbacks of previous methods.

Findings

01

New stability measure effectively treats highly similar features as exchangeable.

02

Proposed measures outperform existing ones on artificial and real datasets.

03

Recommended measure improves feature stability evaluation in correlated feature scenarios.

Abstract

For data sets with similar features, for example highly correlated features, most existing stability measures behave in an undesired way: They consider features that are almost identical but have different identifiers as different features. Existing adjusted stability measures, that is, stability measures that take into account the similarities between features, have major theoretical drawbacks. We introduce new adjusted stability measures that overcome these drawbacks. We compare them to each other and to existing stability measures based on both artificial and real sets of selected features. Based on the results, we suggest using one new stability measure that considers highly similar features as exchangeable.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bommert/adjusted-stability-measures
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.