Information Theoretic Perspective on Representation Learning

Deborah Pereg

arXiv:2601.11334·cs.IT·January 19, 2026

Information Theoretic Perspective on Representation Learning

Deborah Pereg

PDF

Open Access

TL;DR

This paper introduces an information-theoretic framework to analyze learned representations in regression tasks, defining concepts like representation-rate and capacity, and deriving fundamental limits on information encoding and compression.

Contribution

It presents a novel theoretical framework that quantifies the limits of representation learning using information theory, including capacity and rate-distortion bounds.

Findings

01

Derived limits on representation reliability based on input-source entropy

02

Defined and analyzed representation capacity in a perturbed setting

03

Established achievable bounds and unified the theoretical results

Abstract

An information-theoretic framework is introduced to analyze last-layer embedding, focusing on learned representations for regression tasks. We define representation-rate and derive limits on the reliability with which input-output information can be represented as is inherently determined by the input-source entropy. We further define representation capacity in a perturbed setting, and representation rate-distortion for a compressed output. We derive the achievable capacity, the achievable representation-rate, and their converse. Finally, we combine the results in a unified setting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques · Human Pose and Action Recognition