A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering
Xuyang Yan, Mrinmoy Sarkar, Biniam Gebru, Shabnam Nazmi, and Abdollah, Homaifar

TL;DR
This paper introduces a supervised feature selection method for mixed-type data that uses density-based clustering to reduce redundancy and improve classification efficiency.
Contribution
It proposes a novel density-based clustering approach for feature selection in mixed-type data, addressing redundancy and relevance issues.
Findings
Outperforms five state-of-the-art methods on thirteen datasets.
Effectively reduces feature redundancy while maintaining classification accuracy.
Demonstrates robustness across diverse real-world datasets.
Abstract
Feature selection methods are widely used to address the high computational overheads and curse of dimensionality in classifying high-dimensional data. Most conventional feature selection methods focus on handling homogeneous features, while real-world datasets usually have a mixture of continuous and discrete features. Some recent mixed-type feature selection studies only select features with high relevance to class labels and ignore the redundancy among features. The determination of an appropriate feature subset is also a challenge. In this paper, a supervised feature selection method using density-based feature clustering (SFSDFC) is proposed to obtain an appropriate final feature subset for mixed-type data. SFSDFC decomposes the feature space into a set of disjoint feature clusters using a novel density-based clustering method. Then, an effective feature selection strategy is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition
MethodsFeature Selection
