Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective
Quan Cui, Bingchen Zhao, Zhao-Min Chen, Borui Zhao, Renjie Song,, Jiajun Liang, Boyan Zhou, Osamu Yoshie

TL;DR
This paper analyzes the trade-off between discriminability and transferability in deep representations for image classification, revealing that over-compression hampers transferability and proposing a contrastive learning framework to mitigate this issue.
Contribution
It introduces the contrastive temporal coding (CTC) framework that alleviates the discriminability-transferability trade-off by addressing over-compression, validated through extensive experiments.
Findings
CTC improves transferability without sacrificing discriminability.
Over-compression of input information causes the trade-off.
Empirical results show notable gains on classification and transfer tasks.
Abstract
This work simultaneously considers the discriminability and transferability properties of deep representations in the typical supervised learning task, i.e., image classification. By a comprehensive temporal analysis, we observe a trade-off between these two properties. The discriminability keeps increasing with the training progressing while the transferability intensely diminishes in the later training period. From the perspective of information-bottleneck theory, we reveal that the incompatibility between discriminability and transferability is attributed to the over-compression of input information. More importantly, we investigate why and how the InfoNCE loss can alleviate the over-compression, and further present a learning framework, named contrastive temporal coding~(CTC), to counteract the over-compression and alleviate the incompatibility. Extensive experiments validate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Image and Signal Denoising Methods · Generative Adversarial Networks and Image Synthesis
MethodsInfoNCE
