Coding schemes in neural networks learning classification tasks
Alexander van Meegen, Haim Sompolinsky

TL;DR
This paper investigates how neural network properties like nonlinearity and weight scaling influence the internal representations, or coding schemes, that emerge during supervised learning of classification tasks, revealing different schemes in linear and nonlinear networks.
Contribution
It provides a Bayesian analysis of neural network coding schemes, showing how nonlinearity and scaling affect emergent feature representations in classification tasks.
Findings
Linear networks develop analog coding schemes.
Nonlinear networks exhibit symmetry breaking leading to sparse or redundant coding.
Network properties significantly influence emergent representations.
Abstract
Neural networks posses the crucial ability to generate meaningful representations of task-dependent features. Indeed, with appropriate scaling, supervised learning in neural networks can result in strong, task-dependent feature learning. However, the nature of the emergent representations, which we call the `coding scheme', is still unclear. To understand the emergent coding scheme, we investigate fully-connected, wide neural networks learning classification tasks using the Bayesian framework where learning shapes the posterior distribution of the network weights. Consistent with previous findings, our analysis of the feature learning regime (also known as `non-lazy', `rich', or `mean-field' regime) shows that the networks acquire strong, data-dependent features. Surprisingly, the nature of the internal representations depends crucially on the neuronal nonlinearity. In linear networks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Scientific Research Methods · Neural Networks and Applications · Advanced Research in Systems and Signal Processing
