Experience Transfer for Multimodal LLM Agents in Minecraft Game
Chenghao Li, Jun Liu, Songbo Zhang, Huadong Jian, Hao Ni, Lik-Hang Lee, Sung-Ho Bae, Guoqing Wang, Yang Yang, and Chaoning Zhang

TL;DR
This paper introduces Echo, a memory framework for multimodal LLM agents in Minecraft, enabling efficient experience transfer and rapid adaptation to new tasks through explicit knowledge decomposition and analogy learning.
Contribution
Echo's novel knowledge decomposition and in-context analogy learning facilitate explicit transfer of experience, improving agent efficiency and adaptability in complex environments.
Findings
Echo achieves 1.3x to 1.7x speed-up on object-unlocking tasks.
Echo exhibits rapid chain-unlocking of multiple items after experience transfer.
Transfer learning accelerates learning in complex interactive environments.
Abstract
Multimodal LLM agents operating in complex game environments must continually reuse past experience to solve new tasks efficiently. In this work, we propose Echo, a transfer-oriented memory framework that enables agents to derive actionable knowledge from prior interactions rather than treating memory as a passive repository of static records. To make transfer explicit, Echo decomposes reusable knowledge into five dimensions: structure, attribute, process, function, and interaction. This formulation allows the agent to identify recurring patterns shared across different tasks and infer what prior experience remains applicable in new situations. Building on this formulation, Echo leverages In-Context Analogy Learning (ICAL) to retrieve relevant experiences and adapt them to unseen tasks through contextual examples. Experiments in Minecraft show that, under a from-scratch learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
