Fundamental Limits of Learning High-dimensional Simplices in Noisy Regimes
Seyed Amir Hossein Saberi, Amir Najafi, Abolfazl Motahari, Babak H. khalaj

TL;DR
This paper establishes sample complexity bounds for learning high-dimensional simplices from noisy data, providing new theoretical limits and algorithms that perform near optimally under certain noise conditions.
Contribution
The paper introduces new bounds for simplex learning in noisy regimes, extending previous work with a Fourier-based recovery method and matching lower bounds in specific SNR regimes.
Findings
Sample complexity scales as (K^2/ε^2) e^{O(K/SNR^2)} for high-dimensional simplices.
Lower bounds show at least Ω(K^3 σ^2/ε^2 + K/ε) samples are needed in noisy settings.
When SNR ≥ Ω(√K), the noisy case complexity matches the noiseless case.
Abstract
In this paper, we establish sample complexity bounds for learning high-dimensional simplices in from noisy data. Specifically, we consider i.i.d. samples uniformly drawn from an unknown simplex in , each corrupted by additive Gaussian noise of unknown variance. We prove an algorithm exists that, with high probability, outputs a simplex within or total variation (TV) distance at most from the true simplex, provided , where is the signal-to-noise ratio. Extending our prior work~\citep{saberi2023sample}, we derive new information-theoretic lower bounds, showing that simplex estimation within TV distance requires at least samples, where denotes the noise variance. In the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Markov Chains and Monte Carlo Methods
