Does Multiple Choice Have a Future in the Age of Generative AI? A   Posttest-only RCT

Danielle R. Thomas; Conrad Borchers; Sanjit Kakarla; Jionghao Lin,; Shambhavi Bhushan; Boyuan Guo; Erin Gatz; Kenneth R. Koedinger

arXiv:2412.10267·cs.HC·December 16, 2024

Does Multiple Choice Have a Future in the Age of Generative AI? A Posttest-only RCT

Danielle R. Thomas, Conrad Borchers, Sanjit Kakarla, Jionghao Lin,, Shambhavi Bhushan, Boyuan Guo, Erin Gatz, Kenneth R. Koedinger

PDF

1 Repo

TL;DR

This study compares multiple-choice questions and open-response tasks in learning, finding MCQs as effective and more time-efficient, with GPT models aiding in grading, thus questioning the future dominance of open responses.

Contribution

It provides empirical evidence on MCQ effectiveness relative to open responses and introduces GPT-based autograding methods for open responses.

Findings

01

No significant difference in learning outcomes across conditions.

02

MCQ condition required less completion time.

03

GPT models effectively autograded open responses for low-stakes assessment.

Abstract

The role of multiple-choice questions (MCQs) as effective learning tools has been debated in past research. While MCQs are widely used due to their ease in grading, open response questions are increasingly used for instruction, given advances in large language models (LLMs) for automated grading. This study evaluates MCQs effectiveness relative to open-response questions, both individually and in combination, on learning. These activities are embedded within six tutor lessons on advocacy. Using a posttest-only randomized control design, we compare the performance of 234 tutors (790 lesson completions) across three conditions: MCQ only, open response only, and a combination of both. We find no significant learning differences across conditions at posttest, but tutors in the MCQ condition took significantly less time to complete instruction. These findings suggest that MCQs are as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cmu-plus/lak2025-advocacy
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Cosine Annealing · Residual Connection · Attention Dropout · Linear Warmup With Cosine Annealing · Discriminative Fine-Tuning · Weight Decay · Softmax