On the Opportunities of Large Language Models for Programming Process   Data

John Edwards; Arto Hellas; Juho Leinonen

arXiv:2411.00414·cs.CY·November 4, 2024

On the Opportunities of Large Language Models for Programming Process Data

John Edwards, Arto Hellas, Juho Leinonen

PDF

Open Access

TL;DR

This paper explores how large language models can be utilized to analyze programming process data, offering new opportunities for automated feedback and summarization in computing education.

Contribution

It presents a discussion on leveraging LLMs for analyzing programming processes and provides a case study demonstrating automatic summarization and feedback generation.

Findings

01

LLMs can effectively summarize programming processes.

02

LLMs enable automated, formative feedback in programming education.

03

The approach advances automated analysis of programming data.

Abstract

Computing educators and researchers have used programming process data to understand how programs are constructed and what sorts of problems students struggle with. Although such data shows promise for using it for feedback, fully automated programming process feedback systems have still been an under-explored area. The recent emergence of large language models (LLMs) have yielded additional opportunities for researchers in a wide variety of fields. LLMs are efficient at transforming content from one format to another, leveraging the body of knowledge they have been trained with in the process. In this article, we discuss opportunities of using LLMs for analyzing programming process data. To complement our discussion, we outline a case study where we have leveraged LLMs for automatically summarizing the programming process and for creating formative feedback on the programming process.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Distributed and Parallel Computing Systems · Topic Modeling