# Cost-Sensitive Feature Selection by Optimizing F-Measures

**Authors:** Meng Liu, Chang Xu, Yong Luo, Chao Xu, Yonggang Wen, Dacheng Tao

arXiv: 1904.02301 · 2019-04-05

## TL;DR

This paper introduces a cost-sensitive feature selection method that optimizes F-measures to address class imbalance, leading to more representative feature subsets for imbalanced datasets.

## Contribution

It proposes a novel cost-sensitive feature selection algorithm based on F-measure optimization, explicitly handling class imbalance in high-dimensional data.

## Key findings

- Improved F-measure performance on benchmark datasets
- Effective handling of class imbalance in feature selection
- Validated on real-world imbalanced datasets

## Abstract

Feature selection is beneficial for improving the performance of general machine learning tasks by extracting an informative subset from the high-dimensional features. Conventional feature selection methods usually ignore the class imbalance problem, thus the selected features will be biased towards the majority class. Considering that F-measure is a more reasonable performance measure than accuracy for imbalanced data, this paper presents an effective feature selection algorithm that explores the class imbalance issue by optimizing F-measures. Since F-measure optimization can be decomposed into a series of cost-sensitive classification problems, we investigate the cost-sensitive feature selection by generating and assigning different costs to each class with rigorous theory guidance. After solving a series of cost-sensitive feature selection problems, features corresponding to the best F-measure will be selected. In this way, the selected features will fully represent the properties of all classes. Experimental results on popular benchmarks and challenging real-world data sets demonstrate the significance of cost-sensitive feature selection for the imbalanced data setting and validate the effectiveness of the proposed method.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.02301/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1904.02301/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/1904.02301/full.md

---
Source: https://tomesphere.com/paper/1904.02301