Implications on Feature Detection when using the Benefit-Cost Ratio

Rudolf Jagdhuber; J\"org Rahnenf\"uhrer

arXiv:2008.05163·stat.ML·August 18, 2020

Implications on Feature Detection when using the Benefit-Cost Ratio

Rudolf Jagdhuber, J\"org Rahnenf\"uhrer

PDF

TL;DR

This paper investigates how using the benefit-cost ratio in feature selection affects the ability to identify relevant features, revealing potential pitfalls when cost differences are large or effect sizes are small.

Contribution

It provides a simulation-based analysis of the implications of the benefit-cost ratio in feature detection, highlighting risks and proposing mitigation strategies.

Findings

01

Large cost differences reduce relevant feature detection

02

Benefit-cost ratio can favor noise features over relevant ones

03

Rescaling costs or using hyperparameters can mitigate issues

Abstract

In many practical machine learning applications, there are two objectives: one is to maximize predictive accuracy and the other is to minimize costs of the resulting model. These costs of individual features may be financial costs, but can also refer to other aspects, like for example evaluation time. Feature selection addresses both objectives, as it reduces the number of features and can improve the generalization ability of the model. If costs differ between features, the feature selection needs to trade-off the individual benefit and cost of each feature. A popular trade-off choice is the ratio of both, the BCR (benefit-cost ratio). In this paper we analyze implications of using this measure with special focus to the ability to distinguish relevant features from noise. We perform a simulation study for different cost and data settings and obtain detection rates of relevant features…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFeature Selection