MovieCORE: COgnitive REasoning in Movies
Gueter Josmy Faure, Min-Hung Chen, Jia-Fong Yeh, Ying Cheng, Hung-Ting Su, Yung-Hao Tang, Shang-Hong Lai, Winston H. Hsu

TL;DR
MovieCORE introduces a challenging VQA dataset focused on deep cognitive understanding of movies, utilizing LLMs for question generation and an enhancement module to improve reasoning in AI models.
Contribution
The paper presents MovieCORE, a novel dataset emphasizing deep cognitive questions, and an agentic approach with LLMs and ACE to enhance video reasoning capabilities.
Findings
ACE improves reasoning performance by up to 25%.
Dataset quality assessed through cognitive tests.
Enhanced understanding of model limitations on nuanced movie questions.
Abstract
This paper introduces MovieCORE, a novel video question answering (VQA) dataset designed to probe deeper cognitive understanding of movie content. Unlike existing datasets that focus on surface-level comprehension, MovieCORE emphasizes questions that engage System-2 thinking while remaining specific to the video material. We present an innovative agentic brainstorming approach, utilizing multiple large language models (LLMs) as thought agents to generate and refine high-quality question-answer pairs. To evaluate dataset quality, we develop a set of cognitive tests assessing depth, thought-provocation potential, and syntactic complexity. We also propose a comprehensive evaluation scheme for assessing VQA model performance on deeper cognitive tasks. To address the limitations of existing video-language models (VLMs), we introduce an agentic enhancement module, Agentic Choice Enhancement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
