Think Before You Act: Decision Transformers with Working Memory

Jikun Kang; Romain Laroche; Xingdi Yuan; Adam Trischler; Xue Liu; Jie; Fu

arXiv:2305.16338·cs.LG·May 30, 2024·5 cites

Think Before You Act: Decision Transformers with Working Memory

Jikun Kang, Romain Laroche, Xingdi Yuan, Adam Trischler, Xue Liu, Jie, Fu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a working memory module for Decision Transformers, inspired by human memory, to improve training efficiency and task generalization by mitigating forgetting across multiple tasks.

Contribution

The paper proposes a novel working memory component for Decision Transformers, enhancing multi-task learning and efficiency by reducing forgetting, inspired by human distributed memory systems.

Findings

01

Improved training efficiency in Atari and Meta-World tasks

02

Enhanced generalization across multiple tasks

03

Memory fine-tuning boosts adaptability

Abstract

Decision Transformer-based decision-making agents have shown the ability to generalize across multiple tasks. However, their performance relies on massive data and computation. We argue that this inefficiency stems from the forgetting phenomenon, in which a model memorizes its behaviors in parameters throughout training. As a result, training on a new task may deteriorate the model's performance on previous tasks. In contrast to LLMs' implicit memory mechanism, the human brain utilizes distributed memory storage, which helps manage and organize multiple skills efficiently, mitigating the forgetting phenomenon. Inspired by this, we propose a working memory module to store, blend, and retrieve information for different downstream tasks. Evaluation results show that the proposed method improves training efficiency and generalization in Atari games and Meta-World object manipulation tasks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luciferkonn/dt_mem
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Reinforcement Learning in Robotics