Compression-Based Regularization with an Application to Multi-Task Learning
Mat\'ias Vera, Leonardo Rey Vega, Pablo Piantanida

TL;DR
This paper introduces an information theoretic regularization approach for multi-task learning that balances model complexity and risk, demonstrating an optimal tradeoff with empirical and hierarchical text categorization applications.
Contribution
It formulates multi-task learning within an information theoretic framework using lossy source coding principles, providing an iterative algorithm with convergence guarantees.
Findings
Existence of an optimal information rate minimizing excess risk.
Algorithm effectively balances risk and model complexity.
Application to hierarchical text categorization shows practical benefits.
Abstract
This paper investigates, from information theoretic grounds, a learning problem based on the principle that any regularity in a given dataset can be exploited to extract compact features from data, i.e., using fewer bits than needed to fully describe the data itself, in order to build meaningful representations of a relevant content (multiple labels). We begin by introducing the noisy lossy source coding paradigm with the log-loss fidelity criterion which provides the fundamental tradeoffs between the \emph{cross-entropy loss} (average risk) and the information rate of the features (model complexity). Our approach allows an information theoretic formulation of the \emph{multi-task learning} (MTL) problem which is a supervised learning framework in which the prediction models for several related tasks are learned jointly from common representations to achieve better generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Data Compression Techniques · Face and Expression Recognition
