Linear-Sample Learning of Low-Rank Distributions

Ayush Jain; Alon Orlitsky

arXiv:2010.00064·cs.LG·October 2, 2020

Linear-Sample Learning of Low-Rank Distributions

Ayush Jain, Alon Orlitsky

PDF

Open Access 1 Video

TL;DR

This paper establishes the sample complexity threshold for learning low-rank matrices in latent-variable models and introduces an efficient algorithm that nearly matches this lower bound, improving upon existing spectral methods.

Contribution

It determines the minimal sample size needed for learning low-rank matrices and proposes a nearly optimal, polynomial-time algorithm that advances spectral techniques.

Findings

01

Sample complexity lower bound: kr/^2 samples

02

Proposed algorithm uses kr/^2 rac{r}{}^2 \, rac{kr}{^2}\, rac{kr}{^2}\, ext{samples}

03

Algorithm improves spectral methods and converges rapidly in spectral distance

Abstract

Many latent-variable applications, including community detection, collaborative filtering, genomic analysis, and NLP, model data as generated by low-rank matrices. Yet despite considerable research, except for very special cases, the number of samples required to efficiently recover the underlying matrices has not been known. We determine the onset of learning in several common latent-variable settings. For all of them, we show that learning $k \times k$ , rank- $r$ , matrices to normalized $L_{1}$ distance $ϵ$ requires $Ω (\frac{k r}{ϵ ^{2}})$ samples, and propose an algorithm that uses $O (\frac{k r}{ϵ ^{2}} lo g^{2} \frac{r}{ϵ})$ samples, a number linear in the high dimension, and nearly linear in the, typically low, rank. The algorithm improves on existing spectral techniques and runs in polynomial time. The proofs establish new results on the rapid convergence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Linear-Sample Learning of Low-Rank Distributions· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning