Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation

Huaying Zhang; Atsushi Hashimoto; Tosho Hirasawa

arXiv:2512.15006·cs.CV·December 18, 2025

Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation

Huaying Zhang, Atsushi Hashimoto, Tosho Hirasawa

PDF

Open Access

TL;DR

This paper introduces a new evaluation protocol and dataset for assessing video question generation models based on their ability to elicit expert knowledge, emphasizing question quality over answerability.

Contribution

We propose a novel evaluation protocol and create the EgoExoAsk dataset to better measure question quality in video question generation for expert knowledge elicitation.

Findings

01

Evaluation protocol aligns with question generation settings

02

Models with richer context perform better in the evaluation

03

EgoExoAsk dataset supports effective benchmarking

Abstract

Skilled human interviewers can extract valuable information from experts. This raises a fundamental question: what makes some questions more effective than others? To address this, a quantitative evaluation of question-generation models is essential. Video question generation (VQG) is a topic for video question answering (VideoQA), where questions are generated for given answers. Their evaluation typically focuses on the ability to answer questions, rather than the quality of generated questions. In contrast, we focus on the question quality in eliciting unseen knowledge from human experts. For a continuous improvement of VQG models, we propose a protocol that evaluates the ability by simulating question-answering communication with experts using a question-to-answer retrieval. We obtain the retriever by constructing a novel dataset, EgoExoAsk, which comprises 27,666 QA pairs generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Mobile Crowdsensing and Crowdsourcing