Attribute-based Explanations of Non-Linear Embeddings of High-Dimensional Data
Jan-Tobias Sohns, Michaela Schmitt, Fabian Jirasek, Hans Hasse, and, Heike Leitte

TL;DR
This paper introduces NoLiES, a novel interactive visualization tool that enhances understanding of non-linear high-dimensional data embeddings through attribute-based explanations and algebraic topology insights.
Contribution
The paper presents a new augmentation method called rangesets and integrates it into NoLiES for improved interpretability of non-linear embeddings.
Findings
Rangesets enable quick detection of data structure and outliers.
NoLiES effectively handles complex attribute distributions and large datasets.
Case studies demonstrate improved understanding of latent features in thermodynamics data.
Abstract
Embeddings of high-dimensional data are widely used to explore data, to verify analysis results, and to communicate information. Their explanation, in particular with respect to the input attributes, is often difficult. With linear projects like PCA the axes can still be annotated meaningfully. With non-linear projections this is no longer possible and alternative strategies such as attribute-based color coding are required. In this paper, we review existing augmentation techniques and discuss their limitations. We present the Non-Linear Embeddings Surveyor (NoLiES) that combines a novel augmentation strategy for projected data (rangesets) with interactive analysis in a small multiples setting. Rangesets use a set-based visualization approach for binned attribute values that enable the user to quickly observe structure and detect outliers. We detail the link between algebraic topology…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Data Visualization and Analytics · Cell Image Analysis Techniques
MethodsPrincipal Components Analysis
