EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

Ziyang Wang; Yue Zhang; Shoubin Yu; Ce Zhang; Zengqi Zhao; Jaehong Yoon; Hyunji Lee; Gedas Bertasius; Mohit Bansal

arXiv:2605.09874·cs.CV·May 12, 2026

EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

Ziyang Wang, Yue Zhang, Shoubin Yu, Ce Zhang, Zengqi Zhao, Jaehong Yoon, Hyunji Lee, Gedas Bertasius, Mohit Bansal

PDF

1 Datasets

TL;DR

EgoMemReason is a new benchmark designed to evaluate long-horizon egocentric video understanding through memory-driven reasoning across days, highlighting the challenges and current limitations of models in this domain.

Contribution

It introduces a comprehensive benchmark with diverse memory tasks, evaluates multiple methods, and reveals significant gaps in current models' ability to handle long-term memory in egocentric videos.

Findings

01

Best models achieve only 39.6% accuracy on the benchmark.

02

Memory performance degrades with longer temporal horizons.

03

Different memory types fail for distinct reasons.

Abstract

Next-generation visual assistants, such as smart glasses, embodied agents, and always-on life-logging systems, must reason over an entire day or more of continuous visual experience. In ultra-long video settings, relevant information is sparsely distributed across hours or days, making memory a fundamental challenge: models must accumulate information over time, recall prior states, track temporal order, and abstract recurring patterns. However, existing week-long video benchmarks are primarily designed for perception and recognition, such as moment localization or global summarization, rather than reasoning that requires integrating evidence across multiple days. To address this gap, we introduce EgoMemReason, a comprehensive benchmark that systematically evaluates week-long egocentric video understanding through memory-driven reasoning. EgoMemReason evaluates three complementary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Ted412/EgoMemReason
dataset· 112 dl
112 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.