ChatGPT and Software Testing Education: Promises & Perils
Sajed Jalil, Suzzana Rafi, Thomas D. LaToza, Kevin Moran, Wing Lam

TL;DR
This paper evaluates ChatGPT's effectiveness in answering software testing questions, revealing moderate accuracy and discussing its implications for education, highlighting both opportunities and risks of AI-assisted learning.
Contribution
It provides an empirical assessment of ChatGPT's performance in a software testing curriculum and explores its potential impact on education practices.
Findings
ChatGPT answers correctly or partially correctly in 55.6% of cases.
It explains answers correctly or partially correctly in 53.0% of cases.
Shared question prompting improves response accuracy slightly.
Abstract
Over the past decade, predictive language modeling for code has proven to be a valuable tool for enabling new forms of automation for developers. More recently, we have seen the advent of general purpose "large language models", based on neural transformer architectures, that have been trained on massive datasets of human written text spanning code and natural language. However, despite the demonstrated representational power of such models, interacting with them has historically been constrained to specific task settings, limiting their general applicability. Many of these limitations were recently overcome with the introduction of ChatGPT, a language model created by OpenAI and trained to operate as a conversational agent, enabling it to answer questions and respond to a wide variety of commands from end users. The introduction of models, such as ChatGPT, has already spurred fervent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Artificial Intelligence in Healthcare and Education · Topic Modeling
