Attention-Privileged Reinforcement Learning

Sasha Salter; Dushyant Rao; Markus Wulfmeier; Raia Hadsell; Ingmar; Posner

arXiv:1911.08363·cs.AI·January 12, 2021·6 cites

Attention-Privileged Reinforcement Learning

Sasha Salter, Dushyant Rao, Markus Wulfmeier, Raia Hadsell, Ingmar, Posner

PDF

Open Access

TL;DR

This paper introduces APRiL, a reinforcement learning method that uses self-supervised attention to focus on task-relevant features, improving sample efficiency and robustness to distractors without needing privileged information during deployment.

Contribution

We propose a novel attention-augmented reinforcement learning framework that leverages privileged information during training to enhance learning efficiency and robustness, applicable to unseen environments.

Findings

01

Accelerated learning in diverse domains

02

Enhanced robustness to distractors

03

Improved performance outside training distribution

Abstract

Image-based Reinforcement Learning is known to suffer from poor sample efficiency and generalisation to unseen visuals such as distractors (task-independent aspects of the observation space). Visual domain randomisation encourages transfer by training over visual factors of variation that may be encountered in the target domain. This increases learning complexity, can negatively impact learning rate and performance, and requires knowledge of potential variations during deployment. In this paper, we introduce Attention-Privileged Reinforcement Learning (APRiL) which uses a self-supervised attention mechanism to significantly alleviate these drawbacks: by focusing on task-relevant aspects of the observations, attention provides robustness to distractors as well as significantly increased learning efficiency. APRiL trains two attention-augmented actor-critic agents: one purely based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · EEG and Brain-Computer Interfaces · Neural dynamics and brain function