Semantic HELM: A Human-Readable Memory for Reinforcement Learning

Fabian Paischer; Thomas Adler; Markus Hofmarcher; Sepp Hochreiter

arXiv:2306.09312·cs.LG·October 30, 2023·5 cites

Semantic HELM: A Human-Readable Memory for Reinforcement Learning

Fabian Paischer, Thomas Adler, Markus Hofmarcher, Sepp Hochreiter

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Semantic HELM, a human-readable memory system for reinforcement learning agents that uses language tokens to represent past events, improving interpretability and efficiency in memory-dependent tasks.

Contribution

It proposes a novel memory mechanism combining CLIP and pretrained language models to create interpretable, human-readable memory representations for reinforcement learning agents.

Findings

01

Outperforms baselines on memory-dependent tasks

02

Converges faster on challenging recognition tasks

03

Enhances interpretability and troubleshooting of agents

Abstract

Reinforcement learning agents deployed in the real world often have to cope with partially observable environments. Therefore, most agents employ memory mechanisms to approximate the state of the environment. Recently, there have been impressive success stories in mastering partially observable environments, mostly in the realm of computer games like Dota 2, StarCraft II, or MineCraft. However, existing methods lack interpretability in the sense that it is not comprehensible for humans what the agent stores in its memory. In this regard, we propose a novel memory mechanism that represents past events in human language. Our method uses CLIP to associate visual inputs with language tokens. Then we feed these tokens to a pretrained language model that serves the agent as memory and provides it with a coherent and human-readable representation of the past. We train our memory mechanism on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ml-jku/helm
pytorchOfficial

Videos

Semantic HELM: A Human-Readable Memory for Reinforcement Learning· slideslive

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsContrastive Language-Image Pre-training