Do LLMs Have Visualization Literacy? An Evaluation on Modified   Visualizations to Test Generalization in Data Interpretation

Jiayi Hong; Christian Seto; Arlen Fan; Ross Maciejewski

arXiv:2501.16277·cs.PF·January 28, 2025·3 cites

Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation

Jiayi Hong, Christian Seto, Arlen Fan, Ross Maciejewski

PDF

Open Access 1 Repo

TL;DR

This study evaluates the visualization literacy of GPT-4 and Google's Gemini using a modified assessment test, revealing their limited ability to interpret visualizations and reliance on prior knowledge over visual data.

Contribution

It introduces a benchmark for assessing LLMs' visualization literacy and highlights their current limitations in data interpretation tasks.

Findings

01

LLMs perform below general public levels in visualization literacy.

02

LLMs rely more on pre-existing knowledge than visual data.

03

Current LLMs have limited capability in visual data interpretation.

Abstract

In this paper, we assess the visualization literacy of two prominent Large Language Models (LLMs): OpenAI's Generative Pretrained Transformers (GPT), the backend of ChatGPT, and Google's Gemini, previously known as Bard, to establish benchmarks for assessing their visualization capabilities. While LLMs have shown promise in generating chart descriptions, captions, and design suggestions, their potential for evaluating visualizations remains under-explored. Collecting data from humans for evaluations has been a bottleneck for visualization research in terms of both time and money, and if LLMs were able to serve, even in some limited role, as evaluators, they could be a significant resource. To investigate the feasibility of using LLMs in the visualization evaluation process, we explore the extent to which LLMs possess visualization literacy -- a crucial factor for their effective utility…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vaderasu/llm4viz-experiments
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistics Education and Methodologies · Artificial Intelligence in Law · Data Analysis with R