Do Language Model Agents Align with Humans in Rating Visualizations? An Empirical Study
Zekai Shao, Yi Shan, Yixuan He, Yuxuan Yao, Junhong Wang, Xiaolong (Luke) Zhang, Yu Zhang, and Siming Chen

TL;DR
This study investigates whether large language model agents can predict human ratings of visualizations, finding they can simulate human feedback effectively when guided by expert confidence, but with limitations in robustness and bias.
Contribution
The paper provides empirical evidence that language model agents can mimic human visualization ratings when guided by expert confidence, highlighting their potential and limitations.
Findings
Agents can replicate human reasoning in visualization ratings.
Alignment improves with expert confidence levels.
Preprocessing and knowledge injection have robustness issues.
Abstract
Large language models encode knowledge in various domains and demonstrate the ability to understand visualizations. They may also capture visualization design knowledge and potentially help reduce the cost of formative studies. However, it remains a question whether large language models are capable of predicting human feedback on visualizations. To investigate this question, we conducted three studies to examine whether large model-based agents can simulate human ratings in visualization tasks. The first study, replicating a published study involving human subjects, shows agents are promising in conducting human-like reasoning and rating, and its result guides the subsequent experimental design. The second study repeated six human-subject studies reported in literature on subjective ratings, but replacing human participants with agents. Consulting with five human experts, this study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
