Beyond Generating Code: Evaluating GPT on a Data Visualization Course
Chen Zhu-Tian, Chenyang Zhang, Qianwen Wang, Jakob Troidl, Simon, Warchol, Johanna Beyer, Nils Gehlenborg, Hanspeter Pfister

TL;DR
This study empirically evaluates GPT-3.5 and GPT-4's performance in a data visualization course, assessing their abilities beyond code generation in tasks like interpretation, design, and insight communication.
Contribution
It provides a comprehensive evaluation of GPT's capabilities in visualization tasks, highlighting strengths, limitations, and implications for visualization education.
Findings
GPT-4 scored 80% on course assessments
GPT can assist in data cleanup and visualization interaction
Humans can distinguish GPT-generated work with 70% accuracy
Abstract
This paper presents an empirical evaluation of the performance of the Generative Pre-trained Transformer (GPT) model in Harvard's CS171 data visualization course. While previous studies have focused on GPT's ability to generate code for visualizations, this study goes beyond code generation to evaluate GPT's abilities in various visualization tasks, such as data interpretation, visualization design, visual data exploration, and insight communication. The evaluation utilized GPT-3.5 and GPT-4 to complete assignments of CS171, and included a quantitative assessment based on the established course rubrics, a qualitative analysis informed by the feedback of three experienced graders, and an exploratory study of GPT's capabilities in completing border visualization tasks. Findings show that GPT-4 scored 80% on quizzes and homework, and TFs could distinguish between GPT- and human-generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Analysis with R · Data Visualization and Analytics · Scientific Computing and Data Management
