A Distance-preserving Matrix Sketch

Leland Wilkinson; Hengrui Luo

arXiv:2009.03979·cs.HC·June 6, 2022

A Distance-preserving Matrix Sketch

Leland Wilkinson, Hengrui Luo

PDF

1 Repo

TL;DR

This paper introduces two novel algorithms for selecting subsets of rows and columns in large matrices to preserve relative distances, enhancing the accuracy of visualizations of big datasets.

Contribution

The paper presents new distance-preserving matrix sketch algorithms that improve the fidelity of visualizations by better maintaining original data relationships.

Findings

01

Algorithms outperform traditional methods in preserving distances

02

Effective on both artificial and real datasets

03

Enhance accuracy of large dataset visualizations

Abstract

Visualizing very large matrices involves many formidable problems. Various popular solutions to these problems involve sampling, clustering, projection, or feature selection to reduce the size and complexity of the original task. An important aspect of these methods is how to preserve relative distances between points in the higher-dimensional space after reducing rows and columns to fit in a lower dimensional space. This aspect is important because conclusions based on faulty visual reasoning can be harmful. Judging dissimilar points as similar or similar points as dissimilar on the basis of a visualization can lead to false conclusions. To ameliorate this bias and to make visualizations of very large datasets feasible, we introduce two new algorithms that respectively select a subset of rows and columns of a rectangular matrix. This selection is designed to preserve relative distances…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hrluo/DistancePreservingMatrixSketch
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFeature Selection