Automatic Scatterplot Design Optimization for Clustering Identification

Ghulam Jilani Quadri; Jennifer Adorno Nieves; Brenton M. Wiernik; Paul; Rosen

arXiv:2207.03355·cs.HC·July 8, 2022

Automatic Scatterplot Design Optimization for Clustering Identification

Ghulam Jilani Quadri, Jennifer Adorno Nieves, Brenton M. Wiernik, Paul, Rosen

PDF

Open Access

TL;DR

This paper introduces an automatic optimization tool for scatterplot design that enhances clustering visibility by adjusting visual parameters, validated through user studies to improve data understanding.

Contribution

It presents a novel framework using merge trees to automatically optimize scatterplot visual encodings for better cluster detection.

Findings

01

Efficiently generates high-quality scatterplots from large parameter spaces.

02

Improves clustering visibility in scatterplots.

03

Validated with user and case studies.

Abstract

Scatterplots are among the most widely used visualization techniques. Compelling scatterplot visualizations improve understanding of data by leveraging visual perception to boost awareness when performing specific visual analytic tasks. Design choices in scatterplots, such as graphical encodings or data aspects, can directly impact decision-making quality for low-level tasks like clustering. Hence, constructing frameworks that consider both the perceptions of the visual encodings and the task being performed enables optimizing visualizations to maximize efficacy. In this paper, we propose an automatic tool to optimize the design factors of scatterplots to reveal the most salient cluster structure. Our approach leverages the merge tree data structure to identify the clusters and optimize the choice of subsampling algorithm, sampling rate, marker size, and marker opacity used to generate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Analysis with R · Advanced Clustering Algorithms Research · Data Visualization and Analytics