Structure from Local Optima: Learning Subspace Juntas via Higher Order   PCA

Santosh S. Vempala; Ying Xiao

arXiv:1108.3329·cs.CC·April 17, 2012·23 cites

Structure from Local Optima: Learning Subspace Juntas via Higher Order PCA

Santosh S. Vempala, Ying Xiao

PDF

Open Access

TL;DR

This paper introduces a tensor-based extension of PCA to identify relevant subspaces in high-dimensional data, enabling learning of complex functions like k-juntas and intersections of halfspaces.

Contribution

It presents a spectral tensor algorithm that generalizes PCA to higher moments, effectively learning k-subspace juntas under Gaussian irrelevant attributes.

Findings

01

Algorithm identifies relevant subspace with complexity T(k,ε)+poly(n).

02

Generalizes learning low-dimensional concepts to high-dimensional settings.

03

Exploits local optima structure of higher moment tensors.

Abstract

We present a generalization of the well-known problem of learning k-juntas in R^n, and a novel tensor algorithm for unraveling the structure of high-dimensional distributions. Our algorithm can be viewed as a higher-order extension of Principal Component Analysis (PCA). Our motivating problem is learning a labeling function in R^n, which is determined by an unknown k-dimensional subspace. This problem of learning a k-subspace junta is a common generalization of learning a k-junta (a function of k coordinates in R^n) and learning intersections of k halfspaces. In this context, we introduce an irrelevant noisy attributes model where the distribution over the "relevant" k-dimensional subspace is independent of the distribution over the (n-k)-dimensional "irrelevant" subspace orthogonal to it. We give a spectral tensor algorithm which identifies the relevant subspace, and thereby learns…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Machine Learning and Algorithms · Tensor decomposition and applications