Contrastive analysis for scatterplot-based representations of dimensionality reduction
Wilson E. Marc\'ilio-Jr, Danilo M. Eler, Rog\'erio E. Garcia

TL;DR
This paper presents a contrastive analysis methodology for scatterplot-based dimensionality reduction, enabling detailed cluster interpretation and feature influence exploration, especially focusing on differences among clusters.
Contribution
It introduces a novel contrastive analysis approach and bipartite graph visualization for better understanding cluster formation in DR scatterplots, addressing limitations of global-focused methods.
Findings
Effective in exploring cluster differences in case studies
Robust quantitative evaluation supports multidimensional analysis
Enhances interpretability of feature influence on clusters
Abstract
Cluster interpretation after dimensionality reduction (DR) is a ubiquitous part of exploring multidimensional datasets. DR results are frequently represented by scatterplots, where spatial proximity encodes similarity among data samples. In the literature, techniques support the understanding of scatterplots' organization by visualizing the importance of the features for cluster definition with layout enrichment strategies. However, current approaches usually focus on global information, hampering the analysis whenever the focus is to understand the differences among clusters. Thus, this paper introduces a methodology to visually explore DR results and interpret clusters' formation based on contrastive analysis. We also introduce a bipartite graph to visually interpret and explore the relationship between the statistical variables employed to understand how the data features influence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
