BOBCAT: Bilevel Optimization-Based Computerized Adaptive Testing
Aritra Ghosh, Andrew Lan

TL;DR
BOBCAT introduces a bilevel optimization framework for computerized adaptive testing that learns data-driven question selection algorithms, outperforming traditional methods in reducing test length across multiple datasets.
Contribution
The paper presents a novel bilevel optimization approach for data-driven question selection in CAT, independent of specific response models, enhancing efficiency and accuracy.
Findings
BOBCAT significantly reduces test length compared to existing methods.
BOBCAT is computationally efficient during adaptive testing.
BOBCAT outperforms traditional CAT methods on five real-world datasets.
Abstract
Computerized adaptive testing (CAT) refers to a form of tests that are personalized to every student/test taker. CAT methods adaptively select the next most informative question/item for each student given their responses to previous questions, effectively reducing test length. Existing CAT methods use item response theory (IRT) models to relate student ability to their responses to questions and static question selection algorithms designed to reduce the ability estimation error as quickly as possible; therefore, these algorithms cannot improve by learning from large-scale student response data. In this paper, we propose BOBCAT, a Bilevel Optimization-Based framework for CAT to directly learn a data-driven question selection algorithm from training data. BOBCAT is agnostic to the underlying student response model and is computationally efficient during the adaptive testing process.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Intelligent Tutoring Systems and Adaptive Learning · Educational Technology and Assessment
