Attention-Privileged Reinforcement Learning
Sasha Salter, Dushyant Rao, Markus Wulfmeier, Raia Hadsell, Ingmar, Posner

TL;DR
This paper introduces APRiL, a reinforcement learning method that uses self-supervised attention to focus on task-relevant features, improving sample efficiency and robustness to distractors without needing privileged information during deployment.
Contribution
We propose a novel attention-augmented reinforcement learning framework that leverages privileged information during training to enhance learning efficiency and robustness, applicable to unseen environments.
Findings
Accelerated learning in diverse domains
Enhanced robustness to distractors
Improved performance outside training distribution
Abstract
Image-based Reinforcement Learning is known to suffer from poor sample efficiency and generalisation to unseen visuals such as distractors (task-independent aspects of the observation space). Visual domain randomisation encourages transfer by training over visual factors of variation that may be encountered in the target domain. This increases learning complexity, can negatively impact learning rate and performance, and requires knowledge of potential variations during deployment. In this paper, we introduce Attention-Privileged Reinforcement Learning (APRiL) which uses a self-supervised attention mechanism to significantly alleviate these drawbacks: by focusing on task-relevant aspects of the observations, attention provides robustness to distractors as well as significantly increased learning efficiency. APRiL trains two attention-augmented actor-critic agents: one purely based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · EEG and Brain-Computer Interfaces · Neural dynamics and brain function
