Information Structure in Mappings: An Approach to Learning, Representation, and Generalisation
Henry Conklin

TL;DR
This paper develops quantitative methods to analyze the structure of neural network representations, revealing how they learn, generalize, and relate to human cognition across various models and scales.
Contribution
It introduces novel techniques for identifying and quantifying structural primitives in neural mappings, enabling deeper understanding of learning and generalization.
Findings
Structural primitives correlate with model performance
Representation structures evolve during training
Language structures parallel neural network organization
Abstract
Despite the remarkable success of large large-scale neural networks, we still lack unified notation for thinking about and describing their representational spaces. We lack methods to reliably describe how their representations are structured, how that structure emerges over training, and what kinds of structures are desirable. This thesis introduces quantitative methods for identifying systematic structure in a mapping between spaces, and leverages them to understand how deep-learning models learn to represent information, what representational structures drive generalisation, and how design decisions condition the structures that emerge. To do this I identify structural primitives present in a mapping, along with information theoretic quantifications of each. These allow us to analyse learning, structure, and generalisation across multi-agent reinforcement learning models,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization
