Pareto-optimal data compression for binary classification tasks
Max Tegmark (MIT), Tailin Wu (MIT)

TL;DR
This paper introduces a method to map data into a compressed representation that optimally balances information retention about a class label and entropy, specifically for binary classification, using Pareto frontier analysis.
Contribution
It presents a novel approach to visualize and compute the Pareto frontier for data compression in classification tasks, including a lossless reduction to a real-valued variable and a binning strategy for binary cases.
Findings
Efficiently maps data to a real-valued variable preserving all class information.
Provides a method to sweep the Pareto frontier by binning the real-valued variable.
Demonstrates the approach on CIFAR-10, MNIST, and Fashion-MNIST datasets.
Abstract
The goal of lossy data compression is to reduce the storage cost of a data set while retaining as much information as possible about something () that you care about. For example, what aspects of an image contain the most information about whether it depicts a cat? Mathematically, this corresponds to finding a mapping that maximizes the mutual information while the entropy is kept below some fixed threshold. We present a method for mapping out the Pareto frontier for classification tasks, reflecting the tradeoff between retained entropy and class information. We first show how a random variable (an image, say) drawn from a class can be distilled into a vector losslessly, so that ; for example, for a binary classification task of cats and dogs, each image is mapped into a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Machine Learning and Algorithms · Evolutionary Algorithms and Applications
