Do LLMs Make Mistakes Like Students? Exploring Natural Alignment between   Language Models and Human Error Patterns

Naiming Liu; Shashank Sonkar; Richard G. Baraniuk

arXiv:2502.15140·cs.CL·February 24, 2025

Do LLMs Make Mistakes Like Students? Exploring Natural Alignment between Language Models and Human Error Patterns

Naiming Liu, Shashank Sonkar, Richard G. Baraniuk

PDF

TL;DR

This study investigates how well Large Language Models (LLMs) predict student error patterns in multiple-choice questions, revealing moderate alignment and potential for automated distractor generation to improve educational assessments.

Contribution

The paper provides empirical evidence of the correlation between LLM-generated likelihoods and student distractor choices, highlighting the potential of smaller LLMs for educational applications.

Findings

01

Moderate correlation between LLM likelihoods and student distractor selections.

02

LLMs tend to choose the same common student misconceptions when making mistakes.

03

Smaller LLMs can effectively generate distractors similar to larger models.

Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities in various educational tasks, yet their alignment with human learning patterns, particularly in predicting which incorrect options students are most likely to select in multiple-choice questions (MCQs), remains underexplored. Our work investigates the relationship between LLM generation likelihood and student response distributions in MCQs with a specific focus on distractor selections. We collect a comprehensive dataset of MCQs with real-world student response distributions to explore two fundamental research questions: (1). RQ1 - Do the distractors that students more frequently select correspond to those that LLMs assign higher generation likelihood to? (2). RQ2 - When an LLM selects a incorrect choice, does it choose the same distractor that most students pick? Our experiments reveals moderate correlations between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus