On the Connection Between Learning Two-Layers Neural Networks and Tensor Decomposition
Marco Mondelli, Andrea Montanari

TL;DR
This paper links the difficulty of learning two-layer neural networks with tensor decomposition, showing computational hardness under certain conditions and activation functions, and highlighting tensor methods as fundamental to the problem.
Contribution
It establishes a complexity-theoretic connection between neural network learning and tensor decomposition, providing hardness results for polynomial activations under standard assumptions.
Findings
Polynomial-time algorithms cannot outperform trivial predictors in certain regimes
Tensor decomposition methods are central to learning two-layer networks
Hardness results extend to higher degree activations and non-random weights
Abstract
We establish connections between the problem of learning a two-layer neural network and tensor decomposition. We consider a model with feature vectors , hidden units with weights and output , i.e., , with activation functions given by low-degree polynomials. In particular, if , we prove that no polynomial-time learning algorithm can outperform the trivial predictor that assigns to each example the response variable , when . Our conclusion holds for a `natural data distribution', namely standard Gaussian feature vectors , and output distributed according to a two-layer neural network with random isotropic weights, and under a certain complexity-theoretic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Model Reduction and Neural Networks · Machine Learning and ELM
