Automatic Scatterplot Design Optimization for Clustering Identification
Ghulam Jilani Quadri, Jennifer Adorno Nieves, Brenton M. Wiernik, Paul, Rosen

TL;DR
This paper introduces an automatic optimization tool for scatterplot design that enhances clustering visibility by adjusting visual parameters, validated through user studies to improve data understanding.
Contribution
It presents a novel framework using merge trees to automatically optimize scatterplot visual encodings for better cluster detection.
Findings
Efficiently generates high-quality scatterplots from large parameter spaces.
Improves clustering visibility in scatterplots.
Validated with user and case studies.
Abstract
Scatterplots are among the most widely used visualization techniques. Compelling scatterplot visualizations improve understanding of data by leveraging visual perception to boost awareness when performing specific visual analytic tasks. Design choices in scatterplots, such as graphical encodings or data aspects, can directly impact decision-making quality for low-level tasks like clustering. Hence, constructing frameworks that consider both the perceptions of the visual encodings and the task being performed enables optimizing visualizations to maximize efficacy. In this paper, we propose an automatic tool to optimize the design factors of scatterplots to reveal the most salient cluster structure. Our approach leverages the merge tree data structure to identify the clusters and optimize the choice of subsampling algorithm, sampling rate, marker size, and marker opacity used to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Analysis with R · Advanced Clustering Algorithms Research · Data Visualization and Analytics
