Duality between Feature Selection and Data Clustering
Chung Chan, Ali Al-Bashabsheh, Qiaoqiao Zhou, Tie Liu

TL;DR
This paper reveals a fundamental duality between feature selection and data clustering by formulating feature selection through an information-theoretic lens and extending info-clustering methods.
Contribution
It introduces a novel information-theoretic framework that connects feature selection with data clustering via an extension of info-clustering.
Findings
Feature selection can be efficiently solved using info-clustering techniques.
A fundamental duality exists between feature selection and data clustering.
The duality stems from the principal partition and lattice of partitions in combinatorial optimization.
Abstract
The feature-selection problem is formulated from an information-theoretic perspective. We show that the problem can be efficiently solved by an extension of the recently proposed info-clustering paradigm. This reveals the fundamental duality between feature selection and data clustering,which is a consequence of the more general duality between the principal partition and the principal lattice of partitions in combinatorial optimization.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Advanced Clustering Algorithms Research · Data Management and Algorithms
