Contrastive Learning of Preferences with a Contextual InfoNCE Loss
Timo Bertram, Johannes F\"urnkranz, Martin M\"uller

TL;DR
This paper adapts the InfoNCE loss from the CLIP framework to better handle preference ranking problems involving multiple positive associations, demonstrating improved performance in card game data over traditional triplet loss methods.
Contribution
The authors introduce a novel adaptation of the InfoNCE loss for preference learning with multiple positives, addressing batch construction issues in the CLIP framework.
Findings
Adapted InfoNCE outperforms triplet loss in preference ranking tasks.
Vanilla CLIP performs poorly on multi-positive preference data.
The new method alleviates triplet mining problems.
Abstract
A common problem in contextual preference ranking is that a single preferred action is compared against several choices, thereby blowing up the complexity and skewing the preference distribution. In this work, we show how one can solve this problem via a suitable adaptation of the CLIP framework.This adaptation is not entirely straight-forward, because although the InfoNCE loss used by CLIP has achieved great success in computer vision and multi-modal domains, its batch-construction technique requires the ability to compare arbitrary items, and is not well-defined if one item has multiple positive associations in the same batch. We empirically demonstrate the utility of our adapted version of the InfoNCE loss in the domain of collectable card games, where we aim to learn an embedding space that captures the associations between single cards and whole card pools based on human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Multi-Criteria Decision Making · Logic, Reasoning, and Knowledge
MethodsSparse Evolutionary Training · Contrastive Language-Image Pre-training · InfoNCE
