Automatically Redundant Features Removal for Unsupervised Feature Selection via Sparse Feature Graph
Shuchu Han, Hao Huang, Hong Qin

TL;DR
This paper introduces a graph-based method called Sparse Feature Graph (SFG) for automatically identifying and removing redundant features in high-dimensional datasets, thereby enhancing the performance of unsupervised learning algorithms.
Contribution
It proposes a novel SFG approach that models both pairwise and group feature redundancies within an unsupervised feature selection framework.
Findings
SFG effectively models feature redundancy and group redundancy.
Removing redundant features improves unsupervised feature selection performance.
Experimental results show consistent performance gains on benchmark datasets.
Abstract
The redundant features existing in high dimensional datasets always affect the performance of learning and mining algorithms. How to detect and remove them is an important research topic in machine learning and data mining research. In this paper, we propose a graph based approach to find and remove those redundant features automatically for high dimensional data. Based on the sparse learning based unsupervised feature selection framework, Sparse Feature Graph (SFG) is introduced not only to model the redundancy between two features, but also to disclose the group redundancy between two groups of features. With SFG, we can divide the whole features into different groups, and improve the intrinsic structure of data by removing detected redundant features. With accurate data structure, quality indicator vectors can be obtained to improve the learning performance of existing unsupervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and Data Classification · Advanced Image and Video Retrieval Techniques
