The Impact of Negative Sampling on Contrastive Structured World Models

Ondrej Biza; Elise van der Pol; Thomas Kipf

arXiv:2107.11676·cs.LG·July 27, 2021

The Impact of Negative Sampling on Contrastive Structured World Models

Ondrej Biza, Elise van der Pol, Thomas Kipf

PDF

Open Access 1 Repo

TL;DR

This paper investigates how different negative sampling strategies in contrastive learning significantly affect the performance of structured world models, demonstrating improvements through leveraging temporal correlations and dataset diversity.

Contribution

It reveals the critical impact of negative sampling choices on contrastive world models and introduces methods to enhance performance by exploiting temporal correlations.

Findings

01

Leveraging time step correlations doubles model performance.

02

Negative sampling strategies drastically influence contrastive learning outcomes.

03

Diverse datasets enable more robust contrastive world models.

Abstract

World models trained by contrastive learning are a compelling alternative to autoencoder-based world models, which learn by reconstructing pixel states. In this paper, we describe three cases where small changes in how we sample negative states in the contrastive loss lead to drastic changes in model performance. In previously studied Atari datasets, we show that leveraging time step correlations can double the performance of the Contrastive Structured World Model. We also collect a full version of the datasets to study contrastive learning under a more diverse set of experiences.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ondrejba/negative-sampling-icml-21
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Topic Modeling

MethodsContrastive Learning