MINED: Probing and Updating with Multimodal Time-Sensitive Knowledge for Large Multimodal Models

Kailin Jiang; Ning Jiang; Yuntao Du; Yuchen Ren; Yuchen Li; Yifan Gao; Jinhe Bi; Yunpu Ma; Bin Li; Lei Liu; Qing Li

arXiv:2510.19457·cs.CL·April 8, 2026

MINED: Probing and Updating with Multimodal Time-Sensitive Knowledge for Large Multimodal Models

Kailin Jiang, Ning Jiang, Yuntao Du, Yuchen Ren, Yuchen Li, Yifan Gao, Jinhe Bi, Yunpu Ma, Bin Li, Lei Liu, Qing Li

PDF

1 Datasets

TL;DR

This paper introduces MINED, a benchmark for evaluating large multimodal models' understanding of time-sensitive knowledge, revealing current limitations and exploring knowledge updating methods.

Contribution

The paper presents MINED, a new benchmark with 2,104 samples to evaluate temporal awareness in LMMs and investigates knowledge editing for updates.

Findings

01

Gemini-2.5-Pro achieves the highest CEM score of 63.07.

02

Most open-source LMMs lack strong time-sensitive knowledge understanding.

03

LMMs can effectively update knowledge via editing in single scenarios.

Abstract

Large Multimodal Models (LMMs) encode rich factual knowledge via cross-modal pre-training, yet their static representations struggle to maintain an accurate understanding of time-sensitive factual knowledge. Existing benchmarks remain constrained by static designs, inadequately evaluating LMMs' ability to understand time-sensitive knowledge. To address this gap, we propose MINED, a comprehensive benchmark that evaluates temporal awareness along 6 key dimensions and 11 challenging tasks: cognition, awareness, trustworthiness, understanding, reasoning, and robustness. MINED is constructed from Wikipedia by two professional annotators, containing 2,104 time-sensitive knowledge samples spanning six knowledge types. Evaluating 15 widely used LMMs on MINED shows that Gemini-2.5-Pro achieves the highest average CEM score of 63.07, while most open-source LMMs still lack time understanding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

kailinjiang/MINED
dataset· 18 dl
18 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.