Active Learning to Guide Labeling Efforts for Question Difficulty   Estimation

Arthur Thuy; Ekaterina Loginova; Dries F. Benoit

arXiv:2409.09258·cs.LG·October 11, 2024

Active Learning to Guide Labeling Efforts for Question Difficulty Estimation

Arthur Thuy, Ekaterina Loginova, Dries F. Benoit

PDF

Open Access 1 Repo

TL;DR

This paper introduces an active learning approach with a novel acquisition function for question difficulty estimation, significantly reducing labeling efforts while maintaining high performance.

Contribution

It proposes PowerVariance, a new acquisition function for active learning in QDE, and demonstrates its effectiveness with less labeled data needed.

Findings

01

Active learning with PowerVariance achieves near state-of-the-art performance using only 10% of labeled data.

02

The method reduces labeling efforts significantly compared to fully supervised models.

03

The approach is applicable to educational and personalized question-answering systems.

Abstract

In recent years, there has been a surge in research on Question Difficulty Estimation (QDE) using natural language processing techniques. Transformer-based neural networks achieve state-of-the-art performance, primarily through supervised methods but with an isolated study in unsupervised learning. While supervised methods focus on predictive performance, they require abundant labeled data. On the other hand, unsupervised methods do not require labeled data but rely on a different evaluation metric that is also computationally expensive in practice. This work bridges the research gap by exploring active learning for QDE, a supervised human-in-the-loop approach striving to minimize the labeling efforts while matching the performance of state-of-the-art models. The active learning process iteratively trains on a labeled subset, acquiring labels from human experts only for the most…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arthur-thuy/qde-active-learning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Machine Learning and Algorithms

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Softmax · Layer Normalization · WordPiece · Attention Dropout · Dense Connections · Residual Connection · Linear Layer · Adam