TL;DR
SONG is a novel incremental dimensionality reduction method that outperforms existing techniques like t-SNE and UMAP in visualizing evolving datasets, especially with heterogeneous data, noise, and high variance.
Contribution
We introduce SONG, a parametric nonlinear dimensionality reduction technique enabling incremental data visualization with robustness to noise and heterogeneous data.
Findings
SONG outperforms Parametric t-SNE, t-SNE, and UMAP in incremental visualization tasks.
SONG shows significant improvements in cluster quality on real datasets like MNIST and Fashion MNIST.
SONG demonstrates greater noise tolerance compared to UMAP and t-SNE.
Abstract
Non-parametric dimensionality reduction techniques, such as t-SNE and UMAP, are proficient in providing visualizations for datasets of fixed sizes. However, they cannot incrementally map and insert new data points into an already provided data visualization. We present Self-Organizing Nebulous Growths (SONG), a parametric nonlinear dimensionality reduction technique that supports incremental data visualization, i.e., incremental addition of new data while preserving the structure of the existing visualization. In addition, SONG is capable of handling new data increments, no matter whether they are similar or heterogeneous to the already observed data distribution. We test SONG on a variety of real and simulated datasets. The results show that SONG is superior to Parametric t-SNE, t-SNE and UMAP in incremental data visualization. Specifically, for heterogeneous increments, SONG improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
