Efficient Algorithms for t-distributed Stochastic Neighborhood Embedding
George C. Linderman, Manas Rachh, Jeremy G. Hoskins, Stefan, Steinerberger, Yuval Kluger

TL;DR
This paper introduces FIt-SNE, a significantly faster t-SNE implementation using FFT acceleration, along with methods for better cluster visualization and handling large datasets with limited memory.
Contribution
The paper presents FIt-SNE, a novel FFT-accelerated t-SNE algorithm, and introduces out-of-core PCA and late exaggeration techniques for improved scalability and interpretability.
Findings
FIt-SNE dramatically reduces t-SNE computation time.
Out-of-core PCA enables t-SNE on large datasets with limited memory.
Late exaggeration improves cluster detection in embeddings.
Abstract
t-distributed Stochastic Neighborhood Embedding (t-SNE) is a method for dimensionality reduction and visualization that has become widely popular in recent years. Efficient implementations of t-SNE are available, but they scale poorly to datasets with hundreds of thousands to millions of high dimensional data-points. We present Fast Fourier Transform-accelerated Interpolation-based t-SNE (FIt-SNE), which dramatically accelerates the computation of t-SNE. The most time-consuming step of t-SNE is a convolution that we accelerate by interpolating onto an equispaced grid and subsequently using the fast Fourier transform to perform the convolution. We also optimize the computation of input similarities in high dimensions using multi-threaded approximate nearest neighbors. We further present a modification to t-SNE called "late exaggeration," which allows for easier identification of clusters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Face and Expression Recognition · Sparse and Compressive Sensing Techniques
MethodsConvolution
