Optimal Compression for Minimizing Classification Error Probability: an Information-Theoretic Approach
Jingchao Gao, Ao Tang, Weiyu Xu

TL;DR
This paper develops an information-theoretic framework for optimal data compression that minimizes classification error, providing analytical and computational methods to balance compression efficiency and accuracy.
Contribution
It introduces a novel approach to optimize data compression for classification tasks by minimizing mutual information under error constraints, surpassing existing information-bottleneck methods.
Findings
Analytical solution for binary label data compression.
Convex optimization formulation for multi-label data.
Improved classification performance over existing methods.
Abstract
We formulate the problem of performing optimal data compression under the constraints that compressed data can be used for accurate classification in machine learning. We show that this translates to a problem of minimizing the mutual information between data and its compressed version under the constraint on error probability of classification is small when using the compressed data for machine learning. We then provide analytical and computational methods to characterize the optimal trade-off between data compression and classification error probability. First, we provide an analytical characterization for the optimal compression strategy for data with binary labels. Second, for data with multiple labels, we formulate a set of convex optimization problems to characterize the optimal tradeoff, from which the optimal trade-off between the classification error and compression efficiency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Wireless Communication Security Techniques · Distributed Sensor Networks and Detection Algorithms
