Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang,, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des, Combes, John Langford

TL;DR
This paper introduces a new offline RL benchmark for environments with rich, irrelevant exogenous visual information and proposes multi-step inverse models to learn effective agent-controller representations, outperforming existing methods.
Contribution
The paper presents a novel offline RL benchmark for exogenous visual information and demonstrates that multi-step inverse models improve representation learning in noisy, complex environments.
Findings
Multi-step inverse models outperform baselines in noisy environments.
Representations learned with ACRO are more robust to irrelevant visual noise.
The benchmark enables studying exogenous information in offline RL.
Abstract
Learning to control an agent from data collected offline in a rich pixel-based visual observation space is vital for real-world applications of reinforcement learning (RL). A major challenge in this setting is the presence of input information that is hard to model and irrelevant to controlling the agent. This problem has been approached by the theoretical RL community through the lens of exogenous information, i.e, any control-irrelevant information contained in observations. For example, a robot navigating in busy streets needs to ignore irrelevant information, such as other people walking in the background, textures of objects, or birds in the sky. In this paper, we focus on the setting with visually detailed exogenous information, and introduce new offline RL benchmarks offering the ability to study this problem. We find that contemporary representation learning techniques can fail…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 epidemiological studies · Reinforcement Learning in Robotics · Neural dynamics and brain function
Methodsfail
