MovieCORE: COgnitive REasoning in Movies

Gueter Josmy Faure; Min-Hung Chen; Jia-Fong Yeh; Ying Cheng; Hung-Ting Su; Yung-Hao Tang; Shang-Hong Lai; Winston H. Hsu

arXiv:2508.19026·cs.CL·September 19, 2025

MovieCORE: COgnitive REasoning in Movies

Gueter Josmy Faure, Min-Hung Chen, Jia-Fong Yeh, Ying Cheng, Hung-Ting Su, Yung-Hao Tang, Shang-Hong Lai, Winston H. Hsu

PDF

1 Models 1 Datasets 1 Video

TL;DR

MovieCORE introduces a challenging VQA dataset focused on deep cognitive understanding of movies, utilizing LLMs for question generation and an enhancement module to improve reasoning in AI models.

Contribution

The paper presents MovieCORE, a novel dataset emphasizing deep cognitive questions, and an agentic approach with LLMs and ACE to enhance video reasoning capabilities.

Findings

01

ACE improves reasoning performance by up to 25%.

02

Dataset quality assessed through cognitive tests.

03

Enhanced understanding of model limitations on nuanced movie questions.

Abstract

This paper introduces MovieCORE, a novel video question answering (VQA) dataset designed to probe deeper cognitive understanding of movie content. Unlike existing datasets that focus on surface-level comprehension, MovieCORE emphasizes questions that engage System-2 thinking while remaining specific to the video material. We present an innovative agentic brainstorming approach, utilizing multiple large language models (LLMs) as thought agents to generate and refine high-quality question-answer pairs. To evaluate dataset quality, we develop a set of cognitive tests assessing depth, thought-provocation potential, and syntactic complexity. We also propose a comprehensive evaluation scheme for assessing VQA model performance on deeper cognitive tasks. To address the limitations of existing video-language models (VLMs), we introduce an agentic enhancement module, Agentic Choice Enhancement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Joslefaure/HERMES
model

Datasets

MovieCORE/MovieCORE
dataset· 57 dl
57 dl

Videos

MovieCORE: COgnitive REasoning in Movies· underline