Cost-constrained multi-label group feature selection using shadow features
Tomasz Klonecki, Pawe{\l} Teisseyre, Jaesung Lee

TL;DR
This paper introduces a cost-aware multi-label feature selection method using shadow features, optimizing the selection process based on information theory to reduce diagnostic test costs in medical applications.
Contribution
It proposes a novel two-step feature selection approach that incorporates shadow features to control feature addition without complex penalty optimization.
Findings
Effective in medical datasets like MIMIC
Performs well under limited budget constraints
Reduces diagnostic test costs while maintaining accuracy
Abstract
We consider the problem of feature selection in multi-label classification, considering the costs assigned to groups of features. In this task, the goal is to select a subset of features that will be useful for predicting the label vector, but at the same time, the cost associated with the selected features will not exceed the assumed budget. Solving the problem is of great importance in medicine, where we may be interested in predicting various diseases based on groups of features. The groups may be associated with parameters obtained from a certain diagnostic test, such as a blood test. Because diagnostic test costs can be very high, considering cost information when selecting relevant features becomes crucial to reducing the cost of making predictions. We focus on the feature selection method based on information theory. The proposed method consists of two steps. First, we select…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Image Retrieval and Classification Techniques · Face and Expression Recognition
MethodsFeature Selection · Focus
