Contrastive Unsupervised Learning of World Model with Invariant Causal Features
Rudra P.K. Poudel, Harit Pandya, Roberto Cipolla

TL;DR
This paper introduces a contrastive unsupervised learning approach to develop a world model that captures invariant causal features, improving out-of-distribution generalization and sim-to-real transfer in reinforcement learning tasks.
Contribution
It proposes a novel intervention invariant auxiliary task using depth prediction and data augmentation to learn invariant causal features in an unsupervised manner.
Findings
Outperforms state-of-the-art in out-of-distribution point navigation
Enables effective sim-to-real transfer of perception modules
Performs comparably on DeepMind control suite without explicit depth
Abstract
In this paper we present a world model, which learns causal features using the invariance principle. In particular, we use contrastive unsupervised learning to learn the invariant causal features, which enforces invariance across augmentations of irrelevant parts or styles of the observation. The world-model-based reinforcement learning methods independently optimize representation learning and the policy. Thus naive contrastive loss implementation collapses due to a lack of supervisory signals to the representation learning module. We propose an intervention invariant auxiliary task to mitigate this issue. Specifically, we utilize depth prediction to explicitly enforce the invariance and use data augmentation as style intervention on the RGB observation space. Our design leverages unsupervised representation learning to learn the world model with invariant causal features. Our proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
