TL;DR
This paper introduces MMINet, a neural network that uses stochastic mutual information gradient estimation for end-to-end dimensionality reduction, aiming to maximize class-related information without distributional assumptions.
Contribution
It presents a novel neural network training method based on stochastic mutual information gradient estimation for feature transformation in supervised dimensionality reduction.
Findings
Effective on high-dimensional biological data
Outperforms traditional feature selection methods
No distributional assumptions needed
Abstract
Feature ranking and selection is a widely used approach in various applications of supervised dimensionality reduction in discriminative machine learning. Nevertheless there exists significant evidence on feature ranking and selection algorithms based on any criterion leading to potentially sub-optimal solutions for class separability. In that regard, we introduce emerging information theoretic feature transformation protocols as an end-to-end neural network training approach. We present a dimensionality reduction network (MMINet) training procedure based on the stochastic estimate of the mutual information gradient. The network projects high-dimensional features onto an output feature space where lower dimensional representations of features carry maximum mutual information with their associated class labels. Furthermore, we formulate the training objective to be estimated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
