Words of Estimative Correlation: Studying Verbalizations of Scatterplots
Rafael Henkin, Cagatay Turkay

TL;DR
This research investigates how viewers verbalize and interpret correlations in scatterplots, revealing vocabulary patterns and ambiguities, to inform multimodal visualization design and improve data analysis communication.
Contribution
It provides a systematic analysis of verbalizations of scatterplots, developing a taxonomy of concepts and highlighting the ambiguity in verbal descriptions of correlations.
Findings
Participants use diverse vocabulary for scatterplots.
Certain concepts are preferred for higher correlations.
Some verbal concepts are ambiguous across different correlations.
Abstract
Natural language and visualization are being increasingly deployed together for supporting data analysis in different ways, from multimodal interaction to enriched data summaries and insights. Yet, researchers still lack systematic knowledge on how viewers verbalize their interpretations of visualizations, and how they interpret verbalizations of visualizations in such contexts. We describe two studies aimed at identifying characteristics of data and charts that are relevant in such tasks. The first study asks participants to verbalize what they see in scatterplots that depict various levels of correlations. The second study then asks participants to choose visualizations that match a given verbal description of correlation. We extract key concepts from responses, organize them in a taxonomy and analyze the categorized responses. We observe that participants use a wide range of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
