Emblaze: Illuminating Machine Learning Representations through Interactive Comparison of Embedding Spaces
Venkatesh Sivaraman, Yiwei Wu, Adam Perer

TL;DR
Emblaze is an interactive visualization tool that helps machine learning practitioners compare and analyze high-dimensional embedding spaces to identify model flaws and improve representations.
Contribution
We introduce Emblaze, a novel system integrating interactive visualization and analysis techniques for comparing embedding spaces within computational notebooks.
Findings
Enables visual comparison of embedding spaces with an animated scatter plot.
Provides dynamic neighborhood analysis and clustering to highlight interesting changes.
Case studies show improved insights into embedding space structures.
Abstract
Modern machine learning techniques commonly rely on complex, high-dimensional embedding representations to capture underlying structure in the data and improve performance. In order to characterize model flaws and choose a desirable representation, model builders often need to compare across multiple embedding spaces, a challenging analytical task supported by few existing tools. We first interviewed nine embedding experts in a variety of fields to characterize the diverse challenges they face and techniques they use when analyzing embedding spaces. Informed by these perspectives, we developed a novel system called Emblaze that integrates embedding space comparison within a computational notebook environment. Emblaze uses an animated, interactive scatter plot with a novel Star Trail augmentation to enable visual comparison. It also employs novel neighborhood analysis and clustering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Data Analysis with R · Species Distribution and Climate Change
