Category learning in deep neural networks: Information content and geometry of internal representations
Laurent Bonnasse-Gahot, Jean-Pierre Nadal

TL;DR
This paper develops a theoretical framework linking mutual information, neural representation geometry, and category learning in deep neural networks, showing how optimal learning enhances discrimination near decision boundaries.
Contribution
It extends neuroscience-inspired models to artificial networks, revealing how mutual information maximization shapes neural representations and boundary sensitivity during category learning.
Findings
Maximizing mutual information leads to optimal neural projections.
Neural Fisher information aligns with category boundaries after learning.
Eigenvectors of Fisher information identify most discriminant directions.
Abstract
In humans and other animals, category learning enhances discrimination between stimuli close to the category boundary. This phenomenon, called categorical perception, was also empirically observed in artificial neural networks trained on classification tasks. In previous modeling works based on neuroscience data, we show that this expansion/compression is a necessary outcome of efficient learning. Here we extend our theoretical framework to artificial networks. We show that minimizing the Bayes cost (mean of the cross-entropy loss) implies maximizing the mutual information between the set of categories and the neural activities prior to the decision layer. Considering structured data with an underlying feature space of small dimension, we show that maximizing the mutual information implies (i) finding an appropriate projection space, and, (ii) building a neural representation with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
