Educational data augmentation in physics education research using ChatGPT
Fabian Kieser, Peter Wulff, Jochen Kuhn, Stefan K\"uchemann

TL;DR
This study explores using ChatGPT to generate synthetic physics concept inventory data, demonstrating its potential to mimic student responses and introduce variability for educational research purposes.
Contribution
It shows that ChatGPT can accurately solve the FCI and generate diverse responses, including misconception-based answers, aiding physics education research.
Findings
ChatGPT accurately solves the FCI.
Prompting for different cohorts yields no response variance.
Prompting for misconceptions introduces realistic response variability.
Abstract
Generative AI technologies such as large language models show novel potentials to enhance educational research. For example, generative large language models were shown to be capable to solve quantitative reasoning tasks in physics and concept tests such as the Force Concept Inventory. Given the importance of such concept inventories for physics education research, and the challenges in developing them such as field testing with representative populations, this study seeks to examine to what extent a generative large language model could be utilized to generate a synthetic data set for the FCI that exhibits content-related variability in responses. We use the recently introduced ChatGPT based on the GPT 4 generative large language model and investigate to what extent ChatGPT could solve the FCI accurately (RQ1) and could be prompted to solve the FCI as-if it were a student belonging to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational Physics and Python Applications · Explainable Artificial Intelligence (XAI)
