Belief Memory: Agent Memory Under Partial Observability
Junfeng Liao, Qizhou Wang, Jianing Zhu, Bo Du, Rui Yan, Xiuying Chen

TL;DR
BeliefMem introduces a probabilistic memory system for LLM agents that retains multiple hypotheses with their probabilities, improving decision-making under partial observability.
Contribution
It proposes a novel memory paradigm that stores multiple candidate conclusions with probabilities, addressing the limitations of deterministic memory methods.
Findings
BeliefMem outperforms baselines on LoCoMo and ALFWorld benchmarks.
Probabilistic memory enables better uncertainty management in agents.
Empirical results show significant performance gains with limited data.
Abstract
LLM agents that operate over long context depend on external memory to accumulate knowledge over time. However, existing methods typically store each observation as a single deterministic conclusion (e.g., inferring "API~X failed" from temporary errors), even though such observations are inherently partial and potentially ambiguous. By committing to one conclusion and discarding uncertainty, these methods introduce self-reinforcing error: the agent acts on the stored conclusion, never revisits alternatives, and reinforces the conclusion over time. To address this issue, we propose BeliefMem, which shifts the memory paradigm from committing to a single conclusion per observation to retaining multiple candidate conclusions with their probabilities. Concretely, BeliefMem stores the candidate conclusions as separate memory entries, each carrying a probability that is updated via Noisy-OR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
