Data Dialogue with ChatGPT: Using Code Interpreter to Simulate and Analyse Experimental Data
Andrew Low, Z. Yasemin Kalender

TL;DR
This study explores ChatGPT's ability to simulate and analyze physics laboratory data using the Code Interpreter plugin, revealing its potential and limitations in educational laboratory tasks.
Contribution
It demonstrates how ChatGPT, with Code Interpreter, can generate, interpret, and analyze experimental data, highlighting the importance of prompt detail for accurate results.
Findings
ChatGPT can generate realistic lab data and perform data analysis tasks.
The quality of ChatGPT's output depends heavily on prompt specificity.
ChatGPT's data simulation can introduce heteroscedasticity, affecting novice understanding.
Abstract
Artificial Intelligence (AI) has the potential to fundamentally change the educational landscape. So far, much of the physics education research relating to AI has focused on lecture-based assessment and the ability of ChatGPT to answer conceptual surveys and traditional exam-style questions. In this study, we shift the focus by investigating ChatGPT's ability to complete an introductory mechanics laboratory activity by using Code Interpreter, a recent plugin that allows users to generate and analyse data by writing and running Python code `behind the scenes'. By uploading a common `spring constant' lab activity using Code Interpreter, we investigate the ability of ChatGPT to interpret the activity, generate realistic model data, produce a line-fit, and calculate the reduced chi square statistic. By analysing our interactions with ChatGPT, along with the Python code generated by Code…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Scientific Computing and Data Management · Machine Learning and Data Classification
