Contrastive Variational Reinforcement Learning for Complex Observations

Xiao Ma; Siwei Chen; David Hsu; Wee Sun Lee

arXiv:2008.02430·cs.LG·November 10, 2020·6 cites

Contrastive Variational Reinforcement Learning for Complex Observations

Xiao Ma, Siwei Chen, David Hsu, Wee Sun Lee

PDF

Open Access 1 Repo

TL;DR

This paper introduces CVRL, a contrastive variational reinforcement learning method that effectively handles complex visual observations in robotics tasks by maximizing mutual information, leading to improved robustness and performance.

Contribution

CVRL is a novel model-based DRL approach that uses contrastive learning to better manage complex visual inputs without modeling the entire observation space.

Findings

01

Outperforms state-of-the-art methods on Natural Mujoco tasks

02

Achieves superior results on robot box-pushing with complex observations

03

Demonstrates robustness to dynamic shadows and complex visual features

Abstract

Deep reinforcement learning (DRL) has achieved significant success in various robot tasks: manipulation, navigation, etc. However, complex visual observations in natural environments remains a major challenge. This paper presents Contrastive Variational Reinforcement Learning (CVRL), a model-based method that tackles complex visual observations in DRL. CVRL learns a contrastive variational model by maximizing the mutual information between latent states and observations discriminatively, through contrastive learning. It avoids modeling the complex observation space unnecessarily, as the commonly used generative observation model often does, and is significantly more robust. CVRL achieves comparable performance with state-of-the-art model-based DRL methods on standard Mujoco tasks. It significantly outperforms them on Natural Mujoco tasks and a robot box-pushing task with complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Yusufma03/CVRL
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Anomaly Detection Techniques and Applications