Temperature-Free Loss Function for Contrastive Learning
Bum Jun Kim, Sang Woo Kim

TL;DR
This paper introduces a temperature-free variant of the InfoNCE loss for contrastive learning, eliminating the need for hyperparameter tuning and improving gradient properties, with validated performance gains across multiple benchmarks.
Contribution
The paper proposes a novel temperature-free InfoNCE loss using inverse hyperbolic tangent, addressing hyperparameter sensitivity and gradient issues in contrastive learning.
Findings
Achieves comparable or better performance without temperature tuning
Provides better gradient properties for stable training
Validated on five benchmark datasets
Abstract
As one of the most promising methods in self-supervised learning, contrastive learning has achieved a series of breakthroughs across numerous fields. A predominant approach to implementing contrastive learning is applying InfoNCE loss: By capturing the similarities between pairs, InfoNCE loss enables learning the representation of data. Albeit its success, adopting InfoNCE loss requires tuning a temperature, which is a core hyperparameter for calibrating similarity scores. Despite its significance and sensitivity to performance being emphasized by several studies, searching for a valid temperature requires extensive trial-and-error-based experiments, which increases the difficulty of adopting InfoNCE loss. To address this difficulty, we propose a novel method to deploy InfoNCE loss without temperature. Specifically, we replace temperature scaling with the inverse hyperbolic tangent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsContrastive Learning · InfoNCE
