Exploring Multi-dimensional Data via Subset Embedding
Peng Xie, Wenyuan Tao, Jie Li, Wentao Huang, Siming Chen

TL;DR
This paper introduces a novel visual analytics approach using a subset embedding network (SEN) for exploring patterns in multi-dimensional data, enabling efficient and interpretable subset pattern discovery.
Contribution
The paper presents a new subset embedding network design that handles arbitrary subsets and captures feature-based similarities, integrated into a visual system for effective data exploration.
Findings
Effective pattern exploration in multi-dimensional data
High training efficiency due to simple subnet structure
Demonstrated applicability on multiple datasets
Abstract
Multi-dimensional data exploration is a classic research topic in visualization. Most existing approaches are designed for identifying record patterns in dimensional space or subspace. In this paper, we propose a visual analytics approach to exploring subset patterns. The core of the approach is a subset embedding network (SEN) that represents a group of subsets as uniformly-formatted embeddings. We implement the SEN as multiple subnets with separate loss functions. The design enables to handle arbitrary subsets and capture the similarity of subsets on single features, thus achieving accurate pattern exploration, which in most cases is searching for subsets having similar values on few features. Moreover, each subnet is a fully-connected neural network with one hidden layer. The simple structure brings high training efficiency. We integrate the SEN into a visualization system that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Video Analysis and Summarization · Time Series Analysis and Forecasting
