Sliding down the stairs: how correlated latent variables accelerate learning with neural networks
Lorenzo Bardone, Sebastian Goldt

TL;DR
This paper demonstrates that correlations between latent variables in neural network inputs significantly accelerate learning from higher-order statistical features, providing analytical thresholds and empirical validation for this effect.
Contribution
It reveals how latent variable correlations speed up extraction of higher-order features, offering analytical thresholds and empirical evidence for hierarchical learning mechanisms.
Findings
Correlations between latent variables reduce sample complexity for learning higher-order features.
Analytical thresholds for learning directions are derived and validated.
Hierarchical learning mechanisms are unveiled through simulations.
Abstract
Neural networks extract features from data using stochastic gradient descent (SGD). In particular, higher-order input cumulants (HOCs) are crucial for their performance. However, extracting information from the th cumulant of -dimensional inputs is computationally hard: the number of samples required to recover a single direction from an order- tensor (tensor PCA) using online SGD grows as , which is prohibitive for high-dimensional inputs. This result raises the question of how neural networks extract relevant directions from the HOCs of their inputs efficiently. Here, we show that correlations between latent variables along the directions encoded in different input cumulants speed up learning from higher-order correlations. We show this effect analytically by deriving nearly sharp thresholds for the number of samples required by a single neuron to weakly-recover…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Stochastic Gradient Descent
