Structure from Local Optima: Learning Subspace Juntas via Higher Order PCA
Santosh S. Vempala, Ying Xiao

TL;DR
This paper introduces a tensor-based extension of PCA to identify relevant subspaces in high-dimensional data, enabling learning of complex functions like k-juntas and intersections of halfspaces.
Contribution
It presents a spectral tensor algorithm that generalizes PCA to higher moments, effectively learning k-subspace juntas under Gaussian irrelevant attributes.
Findings
Algorithm identifies relevant subspace with complexity T(k,ε)+poly(n).
Generalizes learning low-dimensional concepts to high-dimensional settings.
Exploits local optima structure of higher moment tensors.
Abstract
We present a generalization of the well-known problem of learning k-juntas in R^n, and a novel tensor algorithm for unraveling the structure of high-dimensional distributions. Our algorithm can be viewed as a higher-order extension of Principal Component Analysis (PCA). Our motivating problem is learning a labeling function in R^n, which is determined by an unknown k-dimensional subspace. This problem of learning a k-subspace junta is a common generalization of learning a k-junta (a function of k coordinates in R^n) and learning intersections of k halfspaces. In this context, we introduce an irrelevant noisy attributes model where the distribution over the "relevant" k-dimensional subspace is independent of the distribution over the (n-k)-dimensional "irrelevant" subspace orthogonal to it. We give a spectral tensor algorithm which identifies the relevant subspace, and thereby learns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Machine Learning and Algorithms · Tensor decomposition and applications
