Simple and Effective Balance of Contrastive Losses

Arnaud Sors; Rafael Sampaio de Rezende; Sarah Ibrahimi and; Jean-Marc Andreoli

arXiv:2112.11743·cs.LG·December 23, 2021

Simple and Effective Balance of Contrastive Losses

Arnaud Sors, Rafael Sampaio de Rezende, Sarah Ibrahimi and, Jean-Marc Andreoli

PDF

Open Access

TL;DR

This paper presents a hyper-parameter optimization method for balancing contrastive losses in deep metric and self-supervised learning, improving performance and efficiency.

Contribution

It introduces a coordinate descent-based search for optimal contrastive loss hyper-parameters, including batch size, extending existing analyses and demonstrating faster convergence.

Findings

01

Faster hyper-parameter tuning compared to other methods

02

Effective balance of contrastive loss components improves performance

03

Applicable to various benchmarks in metric and self-supervised learning

Abstract

Contrastive losses have long been a key ingredient of deep metric learning and are now becoming more popular due to the success of self-supervised learning. Recent research has shown the benefit of decomposing such losses into two sub-losses which act in a complementary way when learning the representation network: a positive term and an entropy term. Although the overall loss is thus defined as a combination of two terms, the balance of these two terms is often hidden behind implementation details and is largely ignored and sub-optimal in practice. In this work, we approach the balance of contrastive losses as a hyper-parameter optimization problem, and propose a coordinate descent-based search method that efficiently find the hyper-parameters that optimize evaluation performance. In the process, we extend existing balance analyses to the contrastive margin loss, include batch size in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Face recognition and analysis