In search of the most efficient and memory-saving visualization of high dimensional data
Bartosz Minch

TL;DR
This paper introduces highly efficient CPU and GPU algorithms for visualizing high-dimensional data by embedding large nearest-neighbor graphs into two dimensions, enabling faster and more memory-efficient exploration of complex datasets.
Contribution
The paper presents novel IVHD algorithms that significantly reduce memory and time requirements for high-quality data embeddings compared to existing methods.
Findings
IVHD algorithms outperform existing methods in speed and memory efficiency.
Embeddings preserve key structural features with minimal quality loss.
Meta-algorithm enables supervised use of unsupervised embedding techniques.
Abstract
Interactive exploration of large, multidimensional datasets plays a very important role in various scientific fields. It makes it possible not only to identify important structural features and forms, such as clusters of vertices and their connection patterns, but also to evaluate their interrelationships in terms of position, distance, shape and connection density. We argue that the visualization of multidimensional data is well approximated by the problem of two-dimensional embedding of undirected nearest-neighbor graphs. The size of complex networks is a major challenge for today's computer systems and still requires more efficient data embedding algorithms. Existing reduction methods are too slow and do not allow interactive manipulation. We show that high-quality embeddings are produced with minimal time and memory complexity. We present very efficient IVHD algorithms (CPU and GPU)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Data Visualization and Analytics · Advanced Clustering Algorithms Research
MethodsBalanced Selection
