Contrastive Unsupervised Learning of World Model with Invariant Causal   Features

Rudra P.K. Poudel; Harit Pandya; Roberto Cipolla

arXiv:2209.14932·cs.LG·September 30, 2022

Contrastive Unsupervised Learning of World Model with Invariant Causal Features

Rudra P.K. Poudel, Harit Pandya, Roberto Cipolla

PDF

Open Access

TL;DR

This paper introduces a contrastive unsupervised learning approach to develop a world model that captures invariant causal features, improving out-of-distribution generalization and sim-to-real transfer in reinforcement learning tasks.

Contribution

It proposes a novel intervention invariant auxiliary task using depth prediction and data augmentation to learn invariant causal features in an unsupervised manner.

Findings

01

Outperforms state-of-the-art in out-of-distribution point navigation

02

Enables effective sim-to-real transfer of perception modules

03

Performs comparably on DeepMind control suite without explicit depth

Abstract

In this paper we present a world model, which learns causal features using the invariance principle. In particular, we use contrastive unsupervised learning to learn the invariant causal features, which enforces invariance across augmentations of irrelevant parts or styles of the observation. The world-model-based reinforcement learning methods independently optimize representation learning and the policy. Thus naive contrastive loss implementation collapses due to a lack of supervisory signals to the representation learning module. We propose an intervention invariant auxiliary task to mitigate this issue. Specifically, we utilize depth prediction to explicitly enforce the invariance and use data augmentation as style intervention on the RGB observation space. Our design leverages unsupervised representation learning to learn the world model with invariant causal features. Our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications