Re:Frame -- Retrieving Experience From Associative Memory
Daniil Zelezetsky, Egor Cherepanov, Alexey K. Kovalev, Aleksandr I. Panov

TL;DR
Re:Frame introduces an external associative memory module that retrieves expert experience to significantly enhance offline reinforcement learning performance from scarce and low-quality data without altering the core policy architecture.
Contribution
The paper presents Re:Frame, a novel plug-in module that leverages a small external memory to incorporate expert trajectories into offline RL, improving performance with minimal expert data.
Findings
Re:Frame improves performance on D4RL MuJoCo tasks with as few as 60 expert trajectories.
Using Re:Frame yields up to +10.7 normalized points over baseline.
The method requires no environment interaction or architecture modifications.
Abstract
Offline reinforcement learning (RL) often deals with suboptimal data when collecting large expert datasets is unavailable or impractical. This limitation makes it difficult for agents to generalize and achieve high performance, as they must learn primarily from imperfect or inconsistent trajectories. A central challenge is therefore how to best leverage scarce expert demonstrations alongside abundant but lower-quality data. We demonstrate that incorporating even a tiny amount of expert experience can substantially improve RL agent performance. We introduce Re:Frame (Retrieving Experience From Associative Memory), a plug-in module that augments a standard offline RL policy (e.g., Decision Transformer) with a small external Associative Memory Buffer (AMB) populated by expert trajectories drawn from a separate dataset. During training on low-quality data, the policy learns to retrieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
