Tackling Visual Control via Multi-View Exploration Maximization

Mingqi Yuan; Xin Jin; Bo Li; Wenjun Zeng

arXiv:2211.15233·cs.LG·November 29, 2022

Tackling Visual Control via Multi-View Exploration Maximization

Mingqi Yuan, Xin Jin, Bo Li, Wenjun Zeng

PDF

Open Access

TL;DR

MEM introduces a novel multi-view reinforcement learning approach that combines representation learning and intrinsic reward-driven exploration, significantly improving sample efficiency and generalization in complex visual control tasks.

Contribution

It is the first method to integrate multi-view representation learning with entropy-based exploration rewards in RL.

Findings

01

MEM outperforms existing methods on DeepMind Control Suite tasks.

02

MEM demonstrates higher sample efficiency and better generalization.

03

The approach is effective in high-dimensional, sparse-reward environments.

Abstract

We present MEM: Multi-view Exploration Maximization for tackling complex visual control tasks. To the best of our knowledge, MEM is the first approach that combines multi-view representation learning and intrinsic reward-driven exploration in reinforcement learning (RL). More specifically, MEM first extracts the specific and shared information of multi-view observations to form high-quality features before performing RL on the learned features, enabling the agent to fully comprehend the environment and yield better actions. Furthermore, MEM transforms the multi-view features into intrinsic rewards based on entropy maximization to encourage exploration. As a result, MEM can significantly promote the sample-efficiency and generalization ability of the RL agent, facilitating solving real-world problems with high-dimensional observations and spare-reward space. We evaluate MEM on various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics