Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes
Amir Hossein Saberi, Amir Najafi, Seyed Abolfazl Motahari, Babak H., Khalaj

TL;DR
This paper establishes sample complexity bounds for learning high-dimensional simplices from noisy data, demonstrating that under certain SNR conditions, the complexity matches the noiseless case, and introduces a Fourier-based recovery technique.
Contribution
It provides the first theoretical bounds for learning simplices in noisy regimes and introduces a novel Fourier analysis method for distribution recovery.
Findings
Sample complexity bound: $n \,\ge\, (K^2/\varepsilon^2) e^{\Omega(K/\mathrm{SNR}^2)}$
As long as SNR ≥ Ω(√K), noisy and noiseless sample complexities are comparable
Proposes a Fourier-based technique for recovering distributions from Gaussian noise
Abstract
In this paper, we find a sample complexity bound for learning a simplex from noisy samples. Assume a dataset of size is given which includes i.i.d. samples drawn from a uniform distribution over an unknown simplex in , where samples are assumed to be corrupted by a multi-variate additive Gaussian noise of an arbitrary magnitude. We prove the existence of an algorithm that with high probability outputs a simplex having a distance of at most from the true simplex (for any ). Also, we theoretically show that in order to achieve this bound, it is sufficient to have samples, where stands for the signal-to-noise ratio. This result solves an important open problem and shows as long as , the sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Bayesian Methods and Mixture Models
