Intrinsically Motivated Self-supervised Learning in Reinforcement   Learning

Yue Zhao; Chenzhuang Du; Hang Zhao; Tiejun Li

arXiv:2106.13970·cs.LG·May 16, 2022

Intrinsically Motivated Self-supervised Learning in Reinforcement Learning

Yue Zhao, Chenzhuang Du, Hang Zhao, Tiejun Li

PDF

Open Access

TL;DR

This paper introduces IM-SSR, a method that uses self-supervised loss as an intrinsic reward in reinforcement learning, enhancing exploration, robustness, and sample efficiency in vision-based robotics tasks.

Contribution

It proposes a novel way to utilize self-supervised loss as an intrinsic reward, improving reinforcement learning performance without significant additional costs.

Findings

01

Enhanced sample efficiency in vision-based RL tasks.

02

Improved generalization in robotics tasks.

03

Salient performance gains with sparse rewards.

Abstract

In vision-based reinforcement learning (RL) tasks, it is prevalent to assign auxiliary tasks with a surrogate self-supervised loss so as to obtain more semantic representations and improve sample efficiency. However, abundant information in self-supervised auxiliary tasks has been disregarded, since the representation learning part and the decision-making part are separated. To sufficiently utilize information in auxiliary tasks, we present a simple yet effective idea to employ self-supervised loss as an intrinsic reward, called Intrinsically Motivated Self-Supervised learning in Reinforcement learning (IM-SSR). We formally show that the self-supervised loss can be decomposed as exploration for novel states and robustness improvement from nuisance elimination. IM-SSR can be effortlessly plugged into any reinforcement learning with self-supervised auxiliary objectives with nearly no…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neuroscience and Neural Engineering · Advanced Fluorescence Microscopy Techniques