Active Learning to Guide Labeling Efforts for Question Difficulty Estimation
Arthur Thuy, Ekaterina Loginova, Dries F. Benoit

TL;DR
This paper introduces an active learning approach with a novel acquisition function for question difficulty estimation, significantly reducing labeling efforts while maintaining high performance.
Contribution
It proposes PowerVariance, a new acquisition function for active learning in QDE, and demonstrates its effectiveness with less labeled data needed.
Findings
Active learning with PowerVariance achieves near state-of-the-art performance using only 10% of labeled data.
The method reduces labeling efforts significantly compared to fully supervised models.
The approach is applicable to educational and personalized question-answering systems.
Abstract
In recent years, there has been a surge in research on Question Difficulty Estimation (QDE) using natural language processing techniques. Transformer-based neural networks achieve state-of-the-art performance, primarily through supervised methods but with an isolated study in unsupervised learning. While supervised methods focus on predictive performance, they require abundant labeled data. On the other hand, unsupervised methods do not require labeled data but rely on a different evaluation metric that is also computationally expensive in practice. This work bridges the research gap by exploring active learning for QDE, a supervised human-in-the-loop approach striving to minimize the labeling efforts while matching the performance of state-of-the-art models. The active learning process iteratively trains on a labeled subset, acquiring labels from human experts only for the most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Machine Learning and Algorithms
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Softmax · Layer Normalization · WordPiece · Attention Dropout · Dense Connections · Residual Connection · Linear Layer · Adam
