Write, Rank, or Rate: Comparing Methods for Studying Visualization Affordances
Chase Stokes, Kylie Lin, and Cindy Xiong Bearfield

TL;DR
This paper compares different scalable methods for studying visualization affordances, finding that combined ranking and rating approaches can approximate detailed human interpretations, with GPT-4o serving as a partial proxy.
Contribution
It introduces and evaluates four alternative elicitation methods for visualization affordances and explores the use of GPT-4o as a human proxy, highlighting their strengths and limitations.
Findings
Ranking and rating methods can approximate free-response conclusions
Participant bias influences ranking methodologies
GPT-4o performs well as a proxy for salience ratings
Abstract
A growing body of work on visualization affordances highlights how specific design choices shape reader takeaways from information visualizations. However, mapping the relationship between design choices and reader conclusions often requires labor-intensive crowdsourced studies, generating large corpora of free-response text for analysis. To address this challenge, we explored alternative scalable research methodologies to assess chart affordances. We test four elicitation methods from human-subject studies: free response, visualization ranking, conclusion ranking, and salience rating, and compare their effectiveness in eliciting reader interpretations of line charts, dot plots, and heatmaps. Overall, we find that while no method fully replicates affordances observed in free-response conclusions, combinations of ranking and rating methods can serve as an effective proxy at a broad…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics
