Do Language Model Agents Align with Humans in Rating Visualizations? An Empirical Study

Zekai Shao; Yi Shan; Yixuan He; Yuxuan Yao; Junhong Wang; Xiaolong (Luke) Zhang; Yu Zhang; and Siming Chen

arXiv:2505.06702·cs.HC·May 13, 2025

Do Language Model Agents Align with Humans in Rating Visualizations? An Empirical Study

Zekai Shao, Yi Shan, Yixuan He, Yuxuan Yao, Junhong Wang, Xiaolong (Luke) Zhang, Yu Zhang, and Siming Chen

PDF

Open Access 1 Repo

TL;DR

This study investigates whether large language model agents can predict human ratings of visualizations, finding they can simulate human feedback effectively when guided by expert confidence, but with limitations in robustness and bias.

Contribution

The paper provides empirical evidence that language model agents can mimic human visualization ratings when guided by expert confidence, highlighting their potential and limitations.

Findings

01

Agents can replicate human reasoning in visualization ratings.

02

Alignment improves with expert confidence levels.

03

Preprocessing and knowledge injection have robustness issues.

Abstract

Large language models encode knowledge in various domains and demonstrate the ability to understand visualizations. They may also capture visualization design knowledge and potentially help reduce the cost of formative studies. However, it remains a question whether large language models are capable of predicting human feedback on visualizations. To investigate this question, we conducted three studies to examine whether large model-based agents can simulate human ratings in visualization tasks. The first study, replicating a published study involving human subjects, shows agents are promising in conducting human-like reasoning and rating, and its result guides the subsequent experimental design. The second study repeated six human-subject studies reported in literature on subjective ratings, but replacing human participants with agents. Consulting with five human experts, this study…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZekaiShao25/Agents-Ratings-in-VIS-Experiments
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)