A Semidefinite Programming Based Search Strategy for Feature Selection with Mutual Information Measure
Tofigh Naghibi, Sarah Hoffmann, Beat Pfister

TL;DR
This paper introduces a novel semidefinite programming-based search strategy for feature selection using mutual information, providing a polynomial-time approximation method that outperforms traditional heuristics.
Contribution
It proposes a new parallel search algorithm based on SDP for feature selection, linking it to the maximum-cut problem and deriving its approximation ratio.
Findings
The SDP-based method efficiently searches the feature subset space in polynomial time.
Most heuristic criteria are shown to be truncated versions of mutual information expansions.
The proposed algorithm's approximation ratio is comparable to established methods.
Abstract
Feature subset selection, as a special case of the general subset selection problem, has been the topic of a considerable number of studies due to the growing importance of data-mining applications. In the feature subset selection problem there are two main issues that need to be addressed: (i) Finding an appropriate measure function than can be fairly fast and robustly computed for high-dimensional data. (ii) A search strategy to optimize the measure over the subset space in a reasonable amount of time. In this article mutual information between features and class labels is considered to be the measure function. Two series expansions for mutual information are proposed, and it is shown that most heuristic criteria suggested in the literature are truncated approximations of these expansions. It is well-known that searching the whole subset space is an NP-hard problem. Here, instead of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Face and Expression Recognition
