Event-Centric World Modeling with Memory-Augmented Retrieval for Embodied Decision-Making

Zhaowen Fan; Rongchao Zhang

arXiv:2604.07392·cs.LG·May 1, 2026

Event-Centric World Modeling with Memory-Augmented Retrieval for Embodied Decision-Making

Zhaowen Fan, Rongchao Zhang

PDF

TL;DR

This paper introduces an event-centric world modeling framework with memory-augmented retrieval for embodied decision-making, emphasizing interpretability, physical consistency, and real-time operation in dynamic environments.

Contribution

It proposes a novel structured environment representation and retrieval-based decision-making approach that enhances interpretability and physical consistency in autonomous agents.

Findings

01

Operates within real-time control constraints.

02

Maintains interpretable decision-making via case-based reasoning.

03

Ensures maneuvers are consistent with observed dynamics.

Abstract

Autonomous agents operating in dynamic and safety-critical environments require decision-making frameworks that are both computationally efficient and physically grounded. However, many existing approaches rely on end-to-end learning, which often lacks interpretability and explicit mechanisms for ensuring consistency with physical constraints. In this work, we propose an event-centric world modeling framework with memory-augmented retrieval for embodied decision-making. The framework represents the environment as a structured set of semantic events, which are encoded into a permutation-invariant latent representation. Decision-making is performed via retrieval over a knowledge bank of prior experiences, where each entry associates an event representation with a corresponding maneuver. The final action is computed as a weighted combination of retrieved solutions, providing a transparent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.