EventBench: Towards Comprehensive Benchmarking of Event-based MLLMs

Shaoyu Liu; Jianing Li; Guanghui Zhao; Yunjian Zhang; Xiangyang Ji

arXiv:2511.18448·cs.CV·November 25, 2025

EventBench: Towards Comprehensive Benchmarking of Event-based MLLMs

Shaoyu Liu, Jianing Li, Guanghui Zhao, Yunjian Zhang, Xiangyang Ji

PDF

Open Access

TL;DR

EventBench is a comprehensive benchmark for evaluating event-based multimodal large language models, featuring diverse tasks, large-scale datasets, and evaluation of both open and closed-source models to identify current strengths and weaknesses.

Contribution

The paper introduces EventBench, a novel, open, and large-scale benchmark with diverse tasks and datasets for comprehensive assessment of event-based MLLMs.

Findings

01

Current models excel in event stream understanding.

02

Models still struggle with fine-grained recognition.

03

Spatial reasoning remains challenging for existing models.

Abstract

Multimodal large language models (MLLMs) have made significant advancements in event-based vision, yet the comprehensive evaluation of their capabilities within a unified benchmark remains largely unexplored. In this work, we introduce EventBench, a benchmark that offers eight diverse task metrics together with a large-scale event stream dataset. EventBench differs from existing event-based benchmarks in four key aspects: (1) openness in accessibility, releasing all raw event streams and task instructions across eight evaluation metrics; (2) diversity in task coverage, spanning understanding, recognition, and spatial reasoning tasks for comprehensive capability assessment; (3) integration in spatial dimensions, pioneering the design of 3D spatial reasoning tasks for event-based MLLMs; and (4) scale in data volume, with an accompanying training set of over one million event-text pairs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning