Equilibrium Distributions for t-distributed Stochastic Neighbour   Embedding

Antonio Auffinger; Daniel Fletcher

arXiv:2304.03727·math.PR·April 18, 2023·1 cites

Equilibrium Distributions for t-distributed Stochastic Neighbour Embedding

Antonio Auffinger, Daniel Fletcher

PDF

Open Access

TL;DR

This paper analyzes the convergence of the empirical distribution of t-SNE outputs to an equilibrium distribution, characterized as a solution to a variational problem, under certain input data assumptions.

Contribution

It provides a theoretical framework for understanding the limiting behavior of t-SNE embeddings as the number of data points grows.

Findings

01

Empirical measures converge to an equilibrium distribution.

02

Equilibrium characterized by a variational problem.

03

Conditions on input data ensure convergence.

Abstract

We study the empirical measure of the output of the t-distributed stochastic neighbour embedding algorithm when the initial data is given by n independent, identically distributed inputs. We prove that under certain assumptions on the distribution of the inputs, this sequence of measures converges to an equilibrium distribution, which is described as a solution of a variational problem.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRandom Matrices and Applications · Markov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques