Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation
Jiayi Hong, Christian Seto, Arlen Fan, Ross Maciejewski

TL;DR
This study evaluates the visualization literacy of GPT-4 and Google's Gemini using a modified assessment test, revealing their limited ability to interpret visualizations and reliance on prior knowledge over visual data.
Contribution
It introduces a benchmark for assessing LLMs' visualization literacy and highlights their current limitations in data interpretation tasks.
Findings
LLMs perform below general public levels in visualization literacy.
LLMs rely more on pre-existing knowledge than visual data.
Current LLMs have limited capability in visual data interpretation.
Abstract
In this paper, we assess the visualization literacy of two prominent Large Language Models (LLMs): OpenAI's Generative Pretrained Transformers (GPT), the backend of ChatGPT, and Google's Gemini, previously known as Bard, to establish benchmarks for assessing their visualization capabilities. While LLMs have shown promise in generating chart descriptions, captions, and design suggestions, their potential for evaluating visualizations remains under-explored. Collecting data from humans for evaluations has been a bottleneck for visualization research in terms of both time and money, and if LLMs were able to serve, even in some limited role, as evaluators, they could be a significant resource. To investigate the feasibility of using LLMs in the visualization evaluation process, we explore the extent to which LLMs possess visualization literacy -- a crucial factor for their effective utility…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistics Education and Methodologies · Artificial Intelligence in Law · Data Analysis with R
