A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning

Abdalkarim Mohtasib; Gerhard Neumann; Heriberto Cuayahuitl

arXiv:2108.03222·cs.RO·August 9, 2021

A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning

Abdalkarim Mohtasib, Gerhard Neumann, Heriberto Cuayahuitl

PDF

TL;DR

This paper investigates how different types of visual rewards affect the performance of deep reinforcement learning in robotic tasks, highlighting the advantages of dense over sparse visual rewards and the variability across algorithms.

Contribution

It provides a comparative analysis of state-of-the-art DRL algorithms using various visual reward schemes in simulated robotic tasks.

Findings

01

Visual dense rewards outperform visual sparse rewards.

02

Performance depends on task visibility and reward type.

03

No single algorithm is best for all tasks.

Abstract

Deep Reinforcement Learning (DRL) is a promising approach for teaching robots new behaviour. However, one of its main limitations is the need for carefully hand-coded reward signals by an expert. We argue that it is crucial to automate the reward learning process so that new skills can be taught to robots by their users. To address such automation, we consider task success classifiers using visual observations to estimate the rewards in terms of task success. In this work, we study the performance of multiple state-of-the-art deep reinforcement learning algorithms under different types of reward: Dense, Sparse, Visual Dense, and Visual Sparse rewards. Our experiments in various simulation tasks (Pendulum, Reacher, Pusher, and Fetch Reach) show that while DRL agents can learn successful behaviours using visual rewards when the goal targets are distinguishable, their performance may…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.