TL;DR
This paper introduces inverse contrastive loss (ICL), a novel approach for learning invariant representations by optimizing a loss related to the MMD divergence, improving invariance to extraneous variables.
Contribution
The paper proposes ICL, a new loss function that enables learning invariant representations by connecting contrastive losses with divergence measures, applicable to both binary and general extraneous variables.
Findings
ICL optimization is equivalent to regularized MMD divergence for binary variables.
Models trained with ICL show improved invariance to extraneous variables.
Applicable to both continuous and discrete extraneous variables in various settings.
Abstract
Learning invariant representations is a critical first step in a number of machine learning tasks. A common approach corresponds to the so-called information bottleneck principle in which an application dependent function of mutual information is carefully chosen and optimized. Unfortunately, in practice, these functions are not suitable for optimization purposes since these losses are agnostic of the metric structure of the parameters of the model. We introduce a class of losses for learning representations that are invariant to some extraneous variable of interest by inverting the class of contrastive losses, i.e., inverse contrastive loss (ICL). We show that if the extraneous variable is binary, then optimizing ICL is equivalent to optimizing a regularized MMD divergence. More generally, we also show that if we are provided a metric on the sample space, our formulation of ICL can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
