Data Augmentation for Visualization Design Knowledge Bases

Hyeok Kim; Jeffrey Heer

arXiv:2508.02216·cs.HC·August 5, 2025

Data Augmentation for Visualization Design Knowledge Bases

Hyeok Kim, Jeffrey Heer

PDF

Open Access

TL;DR

This paper introduces data augmentation techniques to expand and improve visualization knowledge bases by generating and labeling new chart pairs, enhancing coverage and recommendation accuracy.

Contribution

It presents novel methods for generating and labeling chart pairs to augment visualization knowledge bases, addressing limitations of existing datasets.

Findings

01

Expanded corpus with thousands of new chart pairs

02

Improved feature coverage in the knowledge base

03

Enhanced chart recommendation performance

Abstract

Visualization knowledge bases enable computational reasoning and recommendation over a visualization design space. These systems evaluate design trade-offs using numeric weights assigned to different features (e.g., binning a variable). Feature weights can be learned automatically by fitting a model to a collection of chart pairs, in which one chart is deemed preferable to the other. To date, labeled chart pairs have been drawn from published empirical research results; however, such pairs are not comprehensive, resulting in a training corpus that lacks many design variants and fails to systematically assess potential trade-offs. To improve knowledge base coverage and accuracy, we contribute data augmentation techniques for generating and labeling chart pairs. We present methods to generate novel chart pairs based on design permutations and by identifying under-assessed features --…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics · Design Education and Practice · Manufacturing Process and Optimization