Feature Selection for multi-labeled variables via Dependency   Maximization

Salimeh Yasaei Sekeh; Alfred O. Hero

arXiv:1902.03544·stat.ML·May 20, 2019·1 cites

Feature Selection for multi-labeled variables via Dependency Maximization

Salimeh Yasaei Sekeh, Alfred O. Hero

PDF

Open Access

TL;DR

This paper introduces a novel feature selection criterion based on dependency maximization using a geometric measure, improving efficiency and effectiveness for multi-labeled classification tasks.

Contribution

It proposes a new non-parametric feature selection method utilizing a dependency measure derived from the Friedman-Rafsky test, suitable for multi-class classification.

Findings

01

Efficient and fast feature selection implementation.

02

Demonstrated advantages through simulation studies.

03

Successfully applied to MNIST dataset.

Abstract

Feature selection and reducing the dimensionality of data is an essential step in data analysis. In this work, we propose a new criterion for feature selection that is formulated as conditional information between features given the labeled variable. Instead of using the standard mutual information measure based on Kullback-Leibler divergence, we use our proposed criterion to filter out redundant features for the purpose of multiclass classification. This approach results in an efficient and fast non-parametric implementation of feature selection as it can be directly estimated using a geometric measure of dependency, the global Friedman-Rafsky (FR) multivariate run test statistic constructed by a global minimal spanning tree (MST). We demonstrate the advantages of our proposed feature selection approach through simulation. In addition the proposed feature selection method is applied to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Bayesian Modeling and Causal Inference