The Prevalence of Neural Collapse in Neural Multivariate Regression
George Andriopoulos, Zixuan Dong, Li Guo, Zifan Zhao, Keith Ross

TL;DR
This paper demonstrates that neural collapse phenomena also occur in multivariate regression tasks, supported by empirical evidence and theoretical modeling, suggesting neural collapse may be a universal behavior in deep learning.
Contribution
It introduces Neural Regression Collapse (NRC), extending neural collapse concepts to regression, with empirical validation and a theoretical explanation via the Unconstrained Feature Model.
Findings
NRC occurs in various datasets and architectures.
NRC involves feature vectors collapsing to target and weight subspaces.
Regularization is crucial for the emergence of NRC.
Abstract
Recently it has been observed that neural networks exhibit Neural Collapse (NC) during the final stage of training for the classification problem. We empirically show that multivariate regression, as employed in imitation learning and other applications, exhibits Neural Regression Collapse (NRC), a new form of neural collapse: (NRC1) The last-layer feature vectors collapse to the subspace spanned by the principal components of the feature vectors, where is the dimension of the targets (for univariate regression, ); (NRC2) The last-layer feature vectors also collapse to the subspace spanned by the last-layer weight vectors; (NRC3) The Gram matrix for the weight vectors converges to a specific functional form that depends on the covariance matrix of the targets. After empirically establishing the prevalence of (NRC1)-(NRC3) for a variety of datasets and network architectures,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications
