ConfusionBench: An Expert-Validated Benchmark for Confusion Recognition and Localization in Educational Videos

Lu Dong; Xiao Wang; Mark Frank; Srirangaraj Setlur; Venu Govindaraju; Ifeoma Nwogu

arXiv:2603.17267·cs.CV·March 19, 2026

ConfusionBench: An Expert-Validated Benchmark for Confusion Recognition and Localization in Educational Videos

Lu Dong, Xiao Wang, Mark Frank, Srirangaraj Setlur, Venu Govindaraju, Ifeoma Nwogu

PDF

Open Access

TL;DR

ConfusionBench is a high-quality, expert-validated benchmark dataset for recognizing and localizing student confusion in educational videos, enabling more reliable and fine-grained educational AI analysis.

Contribution

The paper introduces a multi-stage filtering pipeline to create ConfusionBench, a new benchmark dataset with expert validation for confusion recognition and localization in educational videos.

Findings

01

Proprietary model outperforms open-source in overall confusion recognition

02

Open-source model is more conservative, prone to missed detections

03

Student confusion report visualization aids educational intervention decisions

Abstract

Recognizing and localizing student confusion from video is an important yet challenging problem in educational AI. Existing confusion datasets suffer from noisy labels, coarse temporal annotations, and limited expert validation, which hinder reliable fine-grained recognition and temporally grounded analysis. To address these limitations, we propose a practical multi-stage filtering pipeline that integrates two stages of model-assisted screening, researcher curation, and expert validation to build a higher-quality benchmark for confusion understanding. Based on this pipeline, we introduce ConfusionBench, a new benchmark for educational videos consisting of a balanced confusion recognition dataset and a video localization dataset. We further provide zero-shot baseline evaluations of a representative open-source model and a proprietary model on clip-level confusion recognition, long-video…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Intelligent Tutoring Systems and Adaptive Learning · Explainable Artificial Intelligence (XAI)