Capturing Visualization Design Rationale

Maeve Hutchinson; Radu Jianu; Aidan Slingsby; Jo Wood; Pranava Madhyastha

arXiv:2506.16571·cs.HC·July 2, 2025

Capturing Visualization Design Rationale

Maeve Hutchinson, Radu Jianu, Aidan Slingsby, Jo Wood, Pranava Madhyastha

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel dataset and methodology for understanding visualization design rationale using natural language, leveraging student-created notebooks and large language models to extract and validate design rationales.

Contribution

It presents a new dataset derived from real-world student notebooks and a methodology employing large language models to extract and categorize visualization design rationales.

Findings

01

Curated dataset captures student visualization design rationales.

02

Large language models effectively generate and categorize rationale triples.

03

Validated triples ensure high-quality insights into visualization design choices.

Abstract

Prior natural language datasets for data visualization have focused on tasks such as visualization literacy assessment, insight generation, and visualization generation from natural language instructions. These studies often rely on controlled setups with purpose-built visualizations and artificially constructed questions. As a result, they tend to prioritize the interpretation of visualizations, focusing on decoding visualizations rather than understanding their encoding. In this paper, we present a new dataset and methodology for probing visualization design rationale through natural language. We leverage a unique source of real-world visualizations and natural language narratives: literate visualization notebooks created by students as part of a data visualization course. These notebooks combine visual artifacts with design exposition, in which students make explicit the rationale…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maevehutch/DesignQAR
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics