AI and the FCI: Can ChatGPT Project an Understanding of Introductory Physics?
Colin G. West

TL;DR
This study evaluates ChatGPT's ability to understand introductory physics concepts using a modified Force Concept Inventory, revealing that ChatGPT4 approaches expert-level performance, with implications for education and AI assessment.
Contribution
It provides a comparative analysis of ChatGPT3.5 and ChatGPT4 on physics conceptual questions, extending prior work and highlighting rapid improvements in AI understanding of physics.
Findings
ChatGPT3.5 matches or exceeds first-semester student performance.
ChatGPT4 approaches expert physicist performance in mechanics.
Performance is uneven and nuanced across different questions.
Abstract
ChatGPT is a groundbreaking ``chatbot"--an AI interface built on a large language model that was trained on an enormous corpus of human text to emulate human conversation. Beyond its ability to converse in a plausible way, it has attracted attention for its ability to competently answer questions from the bar exam and from MBA coursework, and to provide useful assistance in writing computer code. These apparent abilities have prompted discussion of ChatGPT as both a threat to the integrity of higher education and conversely as a powerful teaching tool. In this work we present a preliminary analysis of how two versions of ChatGPT (ChatGPT3.5 and ChatGPT4) fare in the field of first-semester university physics, using a modified version of the Force Concept Inventory (FCI) to assess whether it can give correct responses to conceptual physics questions about kinematics and Newtonian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Topic Modeling
MethodsAttentive Walk-Aggregating Graph Neural Network
