Sufficient Dimensionality Reduction with Irrelevant Statistics
Amir Globerson, Gal Chechik, Naftali Tishby

TL;DR
This paper introduces a method for extracting reduced-dimensional features from data that retain maximum relevant information while minimizing irrelevant information, using side data to distinguish between relevant and irrelevant features.
Contribution
It extends the Sufficient Dimensionality Reduction framework by incorporating side information to separate relevant from irrelevant features through a constrained optimization approach.
Findings
Developed a gradient descent algorithm based on maximum entropy principles.
Formulated the feature extraction as a tradeoff between relevance and irrelevance.
Provided theoretical characterization of the solutions to the optimization problem.
Abstract
The problem of finding a reduced dimensionality representation of categorical variables while preserving their most relevant characteristics is fundamental for the analysis of complex data. Specifically, given a co-occurrence matrix of two variables, one often seeks a compact representation of one variable which preserves information about the other variable. We have recently introduced ``Sufficient Dimensionality Reduction' [GT-2003], a method that extracts continuous reduced dimensional features whose measurements (i.e., expectation values) capture maximal mutual information among the variables. However, such measurements often capture information that is irrelevant for a given task. Widely known examples are illumination conditions, which are irrelevant as features for face recognition, writing style which is irrelevant as a feature for content classification, and intonation which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Sparse and Compressive Sensing Techniques · Remote-Sensing Image Classification
