Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Fabian Paischer, Thomas Adler, Markus Hofmarcher, Sepp Hochreiter

TL;DR
This paper introduces Semantic HELM, a human-readable memory system for reinforcement learning agents that uses language tokens to represent past events, improving interpretability and efficiency in memory-dependent tasks.
Contribution
It proposes a novel memory mechanism combining CLIP and pretrained language models to create interpretable, human-readable memory representations for reinforcement learning agents.
Findings
Outperforms baselines on memory-dependent tasks
Converges faster on challenging recognition tasks
Enhances interpretability and troubleshooting of agents
Abstract
Reinforcement learning agents deployed in the real world often have to cope with partially observable environments. Therefore, most agents employ memory mechanisms to approximate the state of the environment. Recently, there have been impressive success stories in mastering partially observable environments, mostly in the realm of computer games like Dota 2, StarCraft II, or MineCraft. However, existing methods lack interpretability in the sense that it is not comprehensible for humans what the agent stores in its memory. In this regard, we propose a novel memory mechanism that represents past events in human language. Our method uses CLIP to associate visual inputs with language tokens. Then we feed these tokens to a pretrained language model that serves the agent as memory and provides it with a coherent and human-readable representation of the past. We train our memory mechanism on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
MethodsContrastive Language-Image Pre-training
