Can LLMs Identify Gaps and Misconceptions in Students' Code   Explanations?

Priti Oli; Rabin Banjade; Andrew M. Olney; Vasile Rus

arXiv:2501.10365·cs.CY·January 22, 2025

Can LLMs Identify Gaps and Misconceptions in Students' Code Explanations?

Priti Oli, Rabin Banjade, Andrew M. Olney, Vasile Rus

PDF

Open Access

TL;DR

This paper explores how different large language model techniques, including prompting, fine-tuning, and preference alignment, can effectively identify gaps and misconceptions in students' code explanations, with GPT-4 showing superior performance.

Contribution

The study compares multiple LLM approaches for identifying misconceptions in student explanations, highlighting the effectiveness of fine-tuning and preference optimization methods.

Findings

01

GPT-4 outperforms LLaMA3 and Mistral in gap detection.

02

Fine-tuned LLMs are more effective than zero-shot prompting.

03

Preference optimization (ORPO) surpasses supervised fine-tuning in accuracy.

Abstract

This paper investigates various approaches using Large Language Models (LLMs) to identify gaps and misconceptions in students' self-explanations of specific instructional material, in our case explanations of code examples. This research is a part of our larger effort to automate the assessment of students' freely generated responses, focusing specifically on their self-explanations of code examples during activities related to code comprehension. In this work, we experiment with zero-shot prompting, Supervised Fine-Tuning (SFT), and preference alignment of LLMs to identify gaps in students' self-explanation. With simple prompting, GPT-4 consistently outperformed LLaMA3 and Mistral in identifying gaps and misconceptions, as confirmed by human evaluations. Additionally, our results suggest that fine-tuned large language models are more effective at identifying gaps in students'…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Assessment and Pedagogy · Natural Language Processing Techniques