Information Theoretic Perspective on Representation Learning
Deborah Pereg

TL;DR
This paper introduces an information-theoretic framework to analyze learned representations in regression tasks, defining concepts like representation-rate and capacity, and deriving fundamental limits on information encoding and compression.
Contribution
It presents a novel theoretical framework that quantifies the limits of representation learning using information theory, including capacity and rate-distortion bounds.
Findings
Derived limits on representation reliability based on input-source entropy
Defined and analyzed representation capacity in a perturbed setting
Established achievable bounds and unified the theoretical results
Abstract
An information-theoretic framework is introduced to analyze last-layer embedding, focusing on learned representations for regression tasks. We define representation-rate and derive limits on the reliability with which input-output information can be represented as is inherently determined by the input-source entropy. We further define representation capacity in a perturbed setting, and representation rate-distortion for a compressed output. We derive the achievable capacity, the achievable representation-rate, and their converse. Finally, we combine the results in a unified setting.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques · Human Pose and Action Recognition
