Multi-Horizon Representations with Hierarchical Forward Models for   Reinforcement Learning

Trevor McInroe; Lukas Sch\"afer; Stefano V. Albrecht

arXiv:2206.11396·cs.LG·January 30, 2024

Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning

Trevor McInroe, Lukas Sch\"afer, Stefano V. Albrecht

PDF

Open Access 1 Repo

TL;DR

This paper introduces HKSL, a hierarchical multi-step auxiliary task for reinforcement learning from pixels, which improves learning efficiency and representation quality across multiple timescales in robotic control tasks.

Contribution

HKSL is a novel hierarchical approach that learns multi-scale representations using forward models and ensemble critics, addressing temporal challenges in pixel-based RL.

Findings

01

HKSL converges faster to higher or optimal returns than alternative methods.

02

HKSL's representations accurately capture task-relevant details across timescales.

03

Communication between hierarchy levels organizes information effectively, enhancing sample efficiency.

Abstract

Learning control from pixels is difficult for reinforcement learning (RL) agents because representation learning and policy learning are intertwined. Previous approaches remedy this issue with auxiliary representation learning tasks, but they either do not consider the temporal aspect of the problem or only consider single-step transitions, which may cause learning inefficiencies if important environmental changes take many steps to manifest. We propose Hierarchical $k$ -Step Latent (HKSL), an auxiliary task that learns multiple representations via a hierarchy of forward models that learn to communicate and an ensemble of $n$ -step critics that all operate at varying magnitudes of step skipping. We evaluate HKSL in a suite of 30 robotic control tasks with and without distractors and a task of our creation. We find that HKSL either converges to higher or optimal episodic returns more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uoe-agents/hksl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Control Systems and Identification · Fuzzy Logic and Control Systems