Jaynes Machine: The universal microstructure of deep neural networks
Venkat Venkatasubramanian, N. Sanjeevrajan, Manasi Khandekar

TL;DR
This paper introduces a universal microstructure theory for deep neural networks, predicting a lognormal distribution of connection strengths across layers, supported by empirical data from six large-scale networks.
Contribution
It develops a novel theoretical framework called statistical teleodynamics to explain the microstructure of deep neural networks and predicts universal properties of their connection distributions.
Findings
Connection strengths follow a lognormal distribution.
Universal microstructure predicted for all layers.
Empirical validation across six large networks.
Abstract
We present a novel theory of the microstructure of deep neural networks. Using a theoretical framework called statistical teleodynamics, which is a conceptual synthesis of statistical thermodynamics and potential game theory, we predict that all highly connected layers of deep neural networks have a universal microstructure of connection strengths that is distributed lognormally (). Furthermore, under ideal conditions, the theory predicts that and are the same for all layers in all networks. This is shown to be the result of an arbitrage equilibrium where all connections compete and contribute the same effective utility towards the minimization of the overall loss function. These surprising predictions are shown to be supported by empirical data from six large-scale deep neural networks in real life. We also discuss how these results can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Thermodynamics and Statistical Mechanics · Statistical Mechanics and Entropy · Quantum many-body systems
