SMMBench: A Benchmark for Source-Distributed Multimodal Agent Memory

Huacan Chai; Yukai Wang; Yingxuan Yang; Dan Peng; Yuanyi Song; Zhihui Fu; Weiwen Liu; Jianghao Lin; Jun Wang; Weinan Zhang

arXiv:2605.15710·cs.CL·May 18, 2026

SMMBench: A Benchmark for Source-Distributed Multimodal Agent Memory

Huacan Chai, Yukai Wang, Yingxuan Yang, Dan Peng, Yuanyi Song, Zhihui Fu, Weiwen Liu, Jianghao Lin, Jun Wang, Weinan Zhang

PDF

1 Repo

TL;DR

SMMBench is a new benchmark designed to evaluate multimodal agents' ability to retrieve and reason over evidence scattered across multiple heterogeneous sources, addressing a key gap in current multimodal memory research.

Contribution

The paper introduces SMMBench, a benchmark for source-distributed multimodal memory reasoning, highlighting its importance and providing initial baseline evaluations.

Findings

01

Current systems struggle with source-distributed multimodal reasoning

02

SMMBench contains 1877 samples from 264 sources

03

Baseline models show significant room for improvement in this task

Abstract

Existing benchmarks for multimodal memory reasoning largely evaluate systems within pre-assembled contexts, but under-evaluate whether agents can use evidence distributed across independently originated sources. We argue that source-distributed memory composition is an important and under-examined bottleneck in multimodal agent memory, especially when relevant evidence is fragmented across heterogeneous artifacts such as conversations, profiles, screenshots, tables, images, and documents. To address this gap, we introduce Source-distributed Multimodal Memory Benchmark(SMMBench), which measures whether agents can retrieve, align, and compose multimodal evidence scattered across multiple sources rather than reason within a single curated context. SMMBench evaluates four core capabilities: (1) cross-source multimodal reasoning; (2) conflict resolution; (3) preference reasoning; (4)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://huggingface.co/datasets/HuacanChai/SMMBench
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.